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(57) Abstract: The present invention provides a vector or vector containing composition comprising a spumavirus backbone and a 
anugen-encodmg nucleic acid. The present invention also provides methods of treating or preventing a condition resulting from a 
vira^, bactenal. or parasitic infection in a subject comprising administering to the subject an effective amount of the vector or vector 
containing composition comprising a spumavirus backbone and an antigen-encoding nucleic acid. Also provided in the present 
invention are methods of treating a condition resulting from a cancer in a subject comprising administering to the subject an effective 
amount of the vector or vector containing composition comprising a spumavirus backbone and an antigen-encoding nucleic acid 
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LIVE REPLICATING SPUMA VIRUS VECTOR 
BACKGROUND OF THE INVENTION 

5 1 . Spumavirus, also known as foamy virus for the characteristics of 

vacuolization the virus induces in cell culture, belongs to a distinct group of 
retroviruses. The simian foamy viruses (SFVs) include isolates from Old World and 
New World monkeys and are classified into 10 different serotypes based on serological 
cross-reactivities. Virus appears to persist in the host for a long period of time in a 

10 latent form and can exist in the presence of neutralizing antibody. 

2. Currently the most studied retrovirus. Human Immunodeficiency Virus, is 
believed to be derived from nonhuman primate transmission into humans at some past 
time. Concerns about the risk of transmission of retroviruses from non-human primates 
to humans working in research laboratories were heightened in the early 1990's when 

1 5 two persons developed antibodies to SIV (Simian Immunodeficiency Virus) following 
work-related exposures, one of whom had clear evidence of persistent viral infection. 
(See CDC anonymous survey for simian immunodeficiency virus (SIV) seropositivity 
in SIV laboratory researchers - United States, 1992. MMWR Morb. Mort. Wkly. Rep. 
1992; 41:8 14-5; Khabbaz R.F., et al. Brief report: infection of a laboratory worker with 

20 simian inununodeficiency virus. New Eng. J. Med. 1994; 330:172-7; Khabbaz R.F., et 
al. Simian immunodeficiency virus needle stick accident in a laboratory worker. 
Lancet 1992; 340:271-3; and CDC. Guideline to prevent simian immunodeficiency 
virus infection in laboratory workers and animal handlers. MMWR 1988; 37:693-704.) 
In addition to SIV, nonhuman primate species used in biomedical research are 

25 commonly infected with SFV (simian foamy virus), STLV (simian t-cell lymphotrophic 
virus), and/or type D retroviruses. All of these retroviruses cause lifelong infections in 
nonhuman primates, and some are known to be transmissible through sexual contact, 
blood, or breast-feeding. Natural SFV infections in non-human primates have not been 
definitively associated with disease. In non-human primates, infection with the other 

30 retroviruses may result in a clinical spectrum ranging from asymptomatic infection to 
life threatening immunodeficiency syndromes or lymphoproliferative disorders. The 
transmission routes of SFVs among non-human primates remain undefined, but the 
prevalence of seroreactivity is high among captive adult non-human primates. 
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3. Recent publications indicate that earlier serological tests showing human 
spumavirus antibodies in the human populating were incorrect. Immunological 
investigation of a previously reported human spumavirus revealed that it shared 
common antigens in complement fixation, immunofluorescence and neutralization 

5 assays with the chimpanzee foamy virus, SFV-6. Furthermore, failure to detect 
serological evidence of HFV infection in people from a wide geographical area 
suggested that this virus isolate was a variant of SFV-6 particularly since sera from 
chimpanzees naturally infected with SFV-6 neutralized both viruses. In a survey for 
prevalence of human foamy virus in more than 5000 human sera, collected from 

10 geographically diverse populations, none of the serum samples were confirmed as 

positive. Taken together with sequence analysis endorsing the phylogenetic closeness 
of the purported human spumavirus to SFV-6/7, these data strongly suggest that human 
foamy virus is not naturally found in the human populations. (See Ali, M. et al., "No 
Evidence of Antibody to Human Foamy Virus in Widespread Human Populations," 

15 AIDS Research and Human Retroviruses, Vol. 12, NO. 15, 1996). 

4. Gene therapies have long looked for a good vector that can transport the 
foreign gene of choice into human cells. Thus, compositions and methods for gene 
therapy are needed that use a vector capable of carrying a significant amount of foreign 
DNA that will enter the host organism and not cause disease. 

20 5. Compositions and methods for vaccination using recombinant live 

retroviruses are also needed. A live virus, that causes no illness in humans, and that has 
genes of antigens of choice incorporated into its genome, would provide for an 
excellent vaccination tool as a vector. The retrovirus would reproduce in the human 
host and expose the immune system to antigens so that an immune response can be 

25 initiated. 

6. Targeted attack on reproducing cells is a goal of cancer treatment. What is 
needed are compositions and methods for cancer treatment that are specific for dividing 
cells that do not cause systemic damage to the cancer patient, A viral vector that could 
infect and kill dividing cells, without killing other cells of the host would provide a 
30 solution for cancer treatment. 
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7. The lack of any known disease associated with the virus from which the 
vector of the present invention was derived makes the present invention ideal for gene 
therapy regimens. 

SUMMARY OF THE INVENTION 

5 8. In accordance with the purposes of this invention, as embodied and broadly 

described herein, this invention, in one aspect, relates to live replicating retroviral 
vectors and methods of their use. 

9. Additional advantages of the invention will be set forth in part in the 
description which follows, and in part will be obvious from the description, or may be 

10 learned by practice of the invention. The advantages of the invention will be realized 
and attained by means of the elements and combinations particularly pointed out in the 
appended claims. It is to be understood that both the foregoing general description and 
the following detailed description are exemplary and explanatory only and are not 
restrictive of the invention, as claimed. 

15 10. The present invention provides compositions comprising live replicating 

retroviral vectors, wherein the vector is derived from a spumavirus, and wherein the ' 
vector further comprises a nucleic acid that encodes a non-spumavirus peptide, 
polypeptide, or protein. Thus, the present invention provides compositions comprising 
live replicating spumavirus vectors, 

20 11. Also provided by the present invention is a method of treating a subject with 

a condition, wherein the condition can be a viral infection, bacterial infection, parasitic 
infection, proliferative disorder (eg. cancer), or a condition associated with a genetic or 
autoimmune disorder; comprising administering to the subject a live replicating viral 
vector, wherein the immunizing construct is specific for the condition. 

25 12. Also provided by the present invention is a method of preventing a 

condition in a subject, wherein the condition can be a viral infection, bacterial infection, 
parasitic infection, proliferative disorder, or a condition associated with a genetic or 
autoimmune disorder; comprising administering to the subject a live replicating viral 
vector, wherein the antigen-encoding nucleic acid is specific for the condition. 

30 13. Also provided are methods of using the present vector for making models 

and using models to study diseases and potential treatments, as well as the models 
themselves. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

14. The accompanying drawings, which are incorporated in and constitute a part 
of this specification, illustrate several embodiments of the invention and together with 
the description, serve to explain the principles of the invention. 
5 15. Figure 1 shows a map of the pHSV vector detailing the location of the 

spumavirus env, pol, gag, and bel-l (las) genes. The map also indicates the presence of 
LTR flanking the coding region for the spumavirus genes and the presence and location 
of Apal and Notl restriction sites. 

16. Figure 2 shows a detailed restriction map of the pHSV vector. The 
10 sequence of the vector is provided in SEQ ID NO: 1. 

17. Figure 3 shows a linear map of pHSV with the pi 7/24 sequence 
incorporated in the vector to generate a pHSV-HIV-gag pl7/24 vector. This shows the 
location of pi 7/24 within the vector. 

18. Figure 4 shows that several TA-p 17/24 clones have the desired length 
15 fragment when cut with the Apal and Notl restriction enzymes. Shown is a 1% 

Acrylamide gel of purified clones resulting the transfection of pl7/24 into the TA 
cloning vector (pCR2. 1). Lane 1 is the marker BSTE II. Lane 2 is an empty pHSV 
vector used as a negative control. Lanes 3-7 are clones 1-5. 

19. Figure 5 shows that p24 expression can be measured by ELISA in clones 2, 
20 3, and 4 at days 2, 4, and 6 days post infection of BHK cells. Samples were also 

measured for p24 expression 6 days post-infection following freeze/thaw (F/T) of the 
clones. 

20. Figure 6 shows that p24 expression is pronounced 5 days post infection in 
clone 2, clone 3 and clone 4. 

25 21 . Figure 7 shows that expression levels of p24 are maintained at high levels 

even on subsequent passages. Second pass clone 2 is diluted 10-fold and dilution was 
used to infect BHK cells. Supematants were removed at 3, 7, and 10 days post 
infection and measured via ELISA for p24 expression. 

22. Figure 8 shows Western blot analysis of clone 2. Lane 1 is empty pHSV 

30 vector, lane 2 is mock infected BHK cells, and lane 3 is BHK cells infected with clone 
2. Samples were probed with LTFOOl, anti-p24, or anti-HIV serum and developed. 
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DETAILED DESCRIPTION 

22. The present invention may be understood more readily by reference to the 
following detailed description of preferred embodiments of the invention and the 
Examples included therein and to the Figures and their previous and following 

S description. 

23. Before the present compounds, compositions, and/or methods are disclosed 
and described, it is to be understood that this invention is not limited to specific 
synthetic methods or specific recombinant biotechnology methods unless otherwise 
specified, or to particular reagents unless otherwise specified, as such may, of course, 

10 vary. It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only and is not intended to be limiting. 
Definitions 

24. As used in the specification and the appended claims, the singular forms "a," 
"an" and "the" include plural referents unless the context clearly dictates otherwise. 

15 Thus, for example, reference to "a pharmaceutical carrier" includes mixtures of two or 
more such carriers, and the like. 

25. Ranges may be expressed herein as fi'om "about" one particular value, 
and/or to "about" another particular value. When such a range is expressed, another 
embodiment includes fi'om the one particular value and/or to the other particular value. 

20 Similarly, when values are expressed as approximations, by use of the antecedent 

"about," it will be understood that the particular value forms another embodiment. It 
will be fiirther understood that the endpoints of each of the ranges are significant both 
in relation to the other endpoint, and independently of the other endpoint. 

26. In this specification and in the claims which follow, reference will be made 
25 to a number of terms which shall be defined to have the following meanings: 

27. "Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where 
said event or circumstance occurs and instances where it does not. 

Vectors 

30 28. Disclosed are live replicating human spumavirus vectors suitable for human 

use comprising an immunizing construct, wherein the immunizing construct is inserted 
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in the bet gene. The disclosed immunizing construct can be an antigen-encoding 
nucleic acid. 

29. Where reference is made to "antigen"-encoding nucleic acid, it is 
understood diat in the context of the invention antigens encoded by the antigen- 

5 encoding nucleic acid can include but are not limited to immunogenic or non- 

immunogenic peptides, polypeptides, proteins, enzymes, cytokines. These antigens can 
be non-himian exogenous antigenic sequences from viruses, bacteria, or parasites. The 
antigens can also be antigenic endogenous human or human derived sequences from a 
condition such as a cancer. Also, peptides encoded by the antigen-encoding nucleic 
10 acid can include non-antigenic sequences for the purposes of gene therapy. 

30. Also disclosed are vectors of the invention, wherein the vector is pHSV or 
pHSV-GFP. Though at any one time reference may only be made to one of the vectors, 
it is an embodiment of the invention that both vectors may be used interchangeably and 
with equivalent results. Accordingly terms referring to either vector should be 

1 5 understood to refer to both vectors. Similarly, reference throughout this disclosure is 
given to the pHS V vector containing an HIV antigen-encoding nucleic acid. It is 
understood that this construct may be referred to as pFOV-gag, pFOV7-gag, pFOV-7- 
gag, pFOV7-pl7/24, pFOV-7-p 17/24, pFOV-pl7/24, pFOV-7GFP-gag, pFOV-7-GFP- 
gag, pFOV-7GFP-pl7/24, pFOV-7-GFP-p 17/24, pFOV-7 HIV-gag pl7/24, pHSV-gag, 

20 pHSV-pl7/24, pHSV-GFP-gag, pHSV-GFP-p 17/24 or pHSV-HIV-gag-p 17/24. A 
vector of the invention, can have the sequence of pHSV-HIV-GAG p 1 7/24. Thus, 
herein disclosed is a pHSV vector of the invention comprising SEQ ID NO: 1 , wherein 
an antigenic construct comprising the gag gene of HIV- 1 (SEQ ID NO: 2) has been 
inserted into the sequence at the unique Apa I site at 12816 and the unique Not I site at 

25 13552. Said insertion resulting in the excision of nucleotides 12817-13551 of the 

vector and therefore partially replacing the bet gene of pHSV with SEQ ID NO: 2 (SEQ 
ID NO: 8). Therefore, also disclosed are vectors of the invention, wherein the antigen- 
encoding nucleic acid is HTV-GAG pl7/p24. Herein, pl7/24 and p41 are used 
synonymously and are intended to refer to DNA encoding Gag or an antigenic fragment 

30 of Gag. It is also herein contemplated that other genes from HIV may be used as the 
antigen-encoding nucleic acid in the vector of the invention. Such genes can include 
the env andpo/ genes of HIV as well as vpu, vj/and nef genes. 
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31. Also disclosed are vectors of the invention, wherein the vector has the 
sequence of pHSV, which is defined by the restriction map shown in Figures 1, 2, and 
3. It is an embodiment of the present invention that modifications can be made to the 
pHSV vector to incorporate additional unique restriction sites throughout the vector 
5 including but not limited to within and around the bel-2(bet), pol, gag, env, or bel-1 
genes. It is also understood that such modifications may provide additional features 
including but not limited to reporter fiinction, resistance to a pharmaceutical agent, or 
enhancers.Related, but distinct vectors are disclosed in U.S. Patent No. 5,646,032 
which is incorporated in its entirety herein by reference. 

10 32. It is well-known in the art that vaccinations can be used prophylacticly for the 

prevention of infections as well as therapeutically for the treatment of ongoing 
conditions. Such infections or conditions can be but are not limited to viral infections. 
Thus, also disclosed are vectors of the invention, wherein the antigen-encoding nucleic 
acid is an antigen from a virus. The viral antigen can be selected fi-om the group of 

15 viruses consisting of Herpes simplex virus type-1, Herpes simplex virus type-2. 
Cytomegalovirus, Epstein-Barr virus. Varicella-zoster vims. Human herpesvims 6, 
Human herpesvims 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus. 
Hepatitis A vims, Hepatitis B vims. Hepatitis C virus. Hepatitis D vims. Hepatitis E 
virus, Rhinovirus, Coronavims, Influenza viras A, Influenza virus B, Measles vims, 

20 Polyomavims, Human Papilomavirus, Respiratory syncytial vims, Adenovims, 

Coxsackie virus. Dengue virus, Mumps vims, Poliovims, Rabies virus, Rous sarcoma 
vims, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever vims. Eastern 
Equine Encephalitis vims, Japanese Encephalitis virus, St. Louis Encephalitis vims, 
Murray Valley fever vims. West Nile vims, Rift Valley fever vims, Rotavims A, 

25 Rotavims B, Rotavims C, Sindbis vims, Simian Immunodeficiency cims. Human T- 

cell Leukemia vims type-1, Hantavims, Rubella vims, Simian Immunodeficiency vims. 
Human Inmiunodeficiency vims type-1, and Human Immunodeficiency vims type-2, 
and Simian Immunodeficiency vims (SIV). Also disclosed are vectors of the invention, 
wherein the antigen-encoding nucleic acid is SIV-GAG. The art is repleat with 

30 examples of viral antigens whose sequences and methods of obtaining them are well 
known. Thus, a pHSV vector expressing these antigens is within the scope of the 
present invention. 
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33. Vaccinations are also known for the prevention of bacterial infections. 
Additionally, anitbiotics are well-known in the art for the treatment of various bacterial 
infections. Herein contemplated and disclosed are vectors of the invention, wherein the 
antigen-encoding nucleic acid is an antigen from a bacterium. The bacterial antigen 

5 can be selected from the group consisting of M. tuberculosis, M bovis, M bovis strain 
BCG, BCG substrains, M avium, M intracellulare, M, africanum, M kansasii, M. 
marinum, M. ulcerans, M. avium subspecies paratuberculosis, Nocardia asteroides, 
other Nocardia species, Legionella pneumophila^ other Legionella species, Salmonella 
typhi, other Salmonella species, Shigella species, Yersinia pestis, Pasteurella 

10 haemolytica, Pasteurella multocida, other Pasteurella species, Actinobacillus 

pleuropneumoniae. Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other 
Brucella species, Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia 
trachomatis. Chlamydia psittaci, Coxiella burnetii, other Rickettsial species. Ehrlichia 
species, Staphylococcus aureus. Staphylococcus epidermidis. Streptococcus pyogenes, 

15 Streptococcus agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, 

Campylobacter species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas 
aeruginosa, other Pseudomonas species, Haemophilus influenzae, Haemophilus 
ducreyi, other Hemophilus species, Clostridium tetani, other Clostridium 
species. Yersinia enterolitica, and other Yersinia species. The art is repleat with 

20 examples of bacterial antigens whose sequences and methods of obtaining them are 
well known. Thus, a pHSV vector expressing these antigens is within the scope of the 
present invention. 

34. The vectors of the invention are not limited to bacteria and viruses. Also 
disclosed are vectors of the invention, wherein the antigen-encoding nucleic acid is an 

25 antigen from a parasite. The parasitic antigen can be selected from the group consisting 
of Toxoplasma gondii, Plasmodium falciparum, Plasmodium vivax, Plasmodium 
malariae, other Plasmodium species., Trypanosoma brucei, Trypanosoma cruzi, 
Leishmania major, other Leishmania species., Schistosoma mansoni, other Schistosoma 
species., and Entamoeba histolytica. The art is repleat with examples of parasitic 

30 antigens whose sequences and methods of obtaining them are well known. Thus, a 
pHSV vector expressing these antigens is within the scope of the present invention. 
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35. The treatment of various forms of cancer is a major concern for millions of 
people worldwide and the focus of much of medical research. Herein contemplated are 
methods of treating a cancer comprising administering to a subject the vector of the 
invention. Therefore, also disclosed are vectors of the invention, wherein the antigen- 

5 encoding nucleic acid is a tumor antigen. The tumor antigen can be selected from the 
list consisting of human epithelial cell mucin (Muc-1; a 20 amino acid core repeat for 
Muc-1 glycoprotein, present on breast cancer cells and pancreatic cancer cells), the Ha- 
ras oncogene product, p53, carcino-embryonic antigen (CEA), the raf oncogene 
product, gpl00/pmell7, GD2, GD3, GM2, TF, sTn, MAGE-1, MAGE-3, BAGE, 

10 GAGE, tyrosinase, gp75, Melan-A/Mart- 1 , gplOO, HER2/neu, EBV-LMP 1 & 2, HPV- 
F4, 6, 7, prostate-specific antigen (PSA), HPV-16, MUM, alpha-fetoprotein (AFP), 
CO 17-1 A, GA733, gp72, p53, the ras oncogene product, HPV E7, Wilm's tumor 
antigen- 1, telomerase, and melanoma gangliosides. Bach of these antigens is known 
and has a coding sequence that is publically available and well-known in the art. 

1 5 Further cancer antigens, whether later discovered or presently know can be expressed 
by the present pHSV vector as described herein. 

36. There are instances wherein it is advantageous to administer the vector of 
the invention in a pharmaceutical composition that comprises other vaccines. 
Pharamceutical compositions comprising multiple vaccines can be for therapeutic or 

20 prophylactic purposes. An example Examples of such a composition is other vaccines 
include the mumps, measles, rubella (MMR) vaccine, and vaccines against M. 
tuberculosis, M. bovis. M. bovis strain BCG, BCG substrains, M. avium, M. 
intracellulare, M. africanum, M. kansasii. M. marimim, M. ulcerans. M. avium 
subspecies paratuberculosis. Nocardia asteroides, other Nocardia species, Legionella 

25 pneumophila, other Legionella species. Salmonella typhi, other Salmonella species. 

Shigella species. Yersinia pestis. Pasteurella haemolytica, Pasteurella multocida. other 
Pasteurella species, Actinobacillus pleuropneimoniae. Listeria monocytogenes. 
Listeria ivanovii. Brucella abortus, other Brucella species, Cowdria ruminantium. 
Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia psittaci, Coxiella burnetii, 

30 other Rickettsial species. Ehrlichia species. Staphylococcus aureus. Staphylococcus 
epidermidis. Streptococcus pyogenes. Streptococcus agalactiae. Bacillus anthracis. 
Escherichia coli. Vibrio cholerae, Campylobacter species. Neiserria meningitidis. 
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Neiserria gonorrhea, Pseudomonas aeruginosa, other Pseudomonas species, 
Haemophilus influenzae, Haemophilus ducreyi, other Hemophilus species, Clostridium 
tetani, other Clostridium spcoies,Yersinia enterolitica, and other Yersinia species . 
Specifically contemplated and disclosed are pharmaceutical compositions comprising 
5 the vector of the invention and one or more additional vaccines. Also disclosed and 
herein contemplated are instances in which the vector of the invention comprises more 
than one antigen-encoding nucleic acid. In such a situation the vector of the invention 
will produce each antigen encoded in the vector as a separate antigen. 

37. There are instances in which a vector of the invention alone may not be 

10 suitable for a given purpose (eg., A kit designed to screen potential drugs for the 
treatment of a condition such kit, intended for use in laboratories without the 
capabilities to transfect a cell-line with the vector). In such cases, cells previously 
transfected with the vector of the invention are needed. Thus, also disclosed are cells 
comprising the vector of the invention. 

15 38. In an embodiment of the invention the antigen-encoding nucleic acid can 

encode a non-antigenic sequence of DNA. This sequence can provide a functional 
copy of a disrupted, mutated, disregulated or deleted gene. Examples of nucleic acids 
encoding proteins that play a role in genetic disorders are known in the literature 
relating to genetic disorders which is incorporated herein by reference. Methods of 

20 making these cells are described and exemplified herein and in the art. 

39. The ability to detect the presence of a construct can be a desireable feature 
of any vector. As such, vectors are often contain a marker to show that the construct of 
interest has been delivered to the subject (eg. a cell), and once delivered, is being 
expressed. A marker can take the form of a gene that is detectable when expressed. 

25 Thus, also disclosed are vectors of the invention further comprising a reporter gene. 
One example of a reporter gene is green fluorescence protein (GFP). 
Compositions 

40. The invention includes a composition comprising a vector of the invention. 
Disclosed are the components to be used to prepare the disclosed compositions as well 

30 as the compositions themselves to be used within the methods disclosed herein. These 
and other materials are disclosed herein, and it is understood that when combinations, 
subsets, interactions, groups, etc. of these materials are disclosed that while specific 
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reference of each various individual and collective combinations and permutation of 
these compounds may not be explicitly disclosed, each is specifically contemplated and 
described herein. For example, if a particular spumavirus vector or antigen-encoding 
nucleic acid (e.g., pl7, p24, pl7/24, gag, gpl20) is disclosed and discussed and a 
5 number of modifications that can be made to a number of molecules including the 
spumavirus vector or antigen-encoding nucleic acid (e.g., pi 7, p24, pi 7/24, gag, 
gpl20) are discussed, specifically contemplated is each and every combination and 
permutation of spumavirus vector and antigen-encoding nucleic acid (e.g., pi 7, p24, 
pi 7/24, gag, gpl20) and the modifications that are possible unless specifically 

10 indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as 

well as a class of molecules D, E, and F and an example of a combination molecule, A- 
D is disclosed, then even if each is not individually recited each is individually and 
collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, 
and C-F are considered disclosed. Likewise, any subset or combination of these is also 

15 disclosed. Thus, for example, the sub-group of A-E, B-F, and C-E would be considered 
disclosed. This concept applies to all aspects of this application including, but not 
limited to, steps in methods of making and using the disclosed compositions. Thus, if 
there are a variety of additional steps that can be performed it is understood that each of 
these additional steps can be performed with any specific embodiment or combination 

20 of embodiments of the disclosed methods. Often therapeutic agents and vaccines are 
administered in formulations or combinations that incorporate other therapeutic 
modalities or necessary components for the purposes of time release, delivery, or 
augmentation of a response. It is understood and herein contemplated that the disclosed 
vectors and exogenous nucleic acid of the invention can be combined and administered 

25 with any such modality or component. 
Expression systems 

41. The nucleic acid vectors of the invention that are delivered to cells typically 
contain expression controlling systems for controlling the expression of 
heterologous/exogenous nucleic acid. For example, the inserted genes in viral and 

30 retroviral systems usually contain promoters, and/or enhancers to help control the 
expression of the desired gene product. A promoter is generally a sequence or 
sequences of DNA that fiinction when in a relatively fixed location in regard to the 
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transcription start site. A promoter contains core elements required for basic 
interaction of RNA polymerase and transcription factors, and may contain upstream 
elements and response elements. 

Viral Promoters and Enhancers 

5 42, Preferred promoters controlling transcription from vectors in manmialian 

host cells may be obtained from various sources, for example, the genomes of viruses 
such as: polyoma. Simian Virus 40 (SV40), adenovims, retroviruses, hepatitis-B virus, 
cytomegalovirus and most preferably spumavims, or from heterologous mammalian 
promoters, e.g. beta actin promoter. Each of these promoters in known and has a 

10 sequence that is publicly available. The early and late promoters of the SV40 virus are 
conveniently obtained as an SV40 restriction fragment which also contains the SV40 
viral origin of replication (Fiers et al.. Nature , 273: 1 13 (1978)). The immediate early 
promoter of the human cytomegalovirus is conveniently obtained as a Hindlll E 
restriction fragment (Greenway, P.J. et al., Gene 18: 355-360 (1982)), Of course, 

15 promoters from the host cell or related species also are useful herein. Methods of 
functionally linking promoters with coding sequences are well-known. 

43. Enhancer generally refers to a sequence of DNA that functions at no fixed 
distance from the transcription start site and can be either 5' (Laimins, L. et al,, Proc. 
Natl, Acad, Sci, 78: 993 (1981)) or 3* (Lusky, M,L., et al., Mol. Cell Bio. 3: 1 108 

20 (1983)) to the transcription unit. Furthermore, enhancers can be within an intron 

(Banerji, J.L, et al., Cell 33: 729 (1983)) as well as within the coding sequence itself 
(Osborne, T.F., et al., Mol. Cell Bio. 4: 1293 (1984)). They are usually between 10 
and 300 bp in length, and they function in cis. Enhancers function to increase 
transcription from nearby promoters. Enhancers also often contain response elements 

25 that mediate the regulation of transcription. Promoters can also contain response 

elements that mediate the regulation of transcription. Enhancers often determine the 
regulation of expression of a gene. While many enhancer sequences are now known 
from manrunalian genes (globin, elastase, albumin, -fetoprotein and insulin), typically 
one will use an enhancer from a eukaryotic cell virus for general expression. Preferred 

30 examples are the SV40 enhancer on the late side of the replication origin (bp 100-270), 
the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers. 

12 
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44. The promotor and/or enhancer may be specifically activated either by light 
or specific chemical events which trigger their function. Systems can be regulated by 
reagents such as tetracycline and dexamethasone. There are also ways to enhance viral 
vector gene expression by exposure to irradiation, such as gamma irradiation, or 

S alkylating chemotherapy drugs. 

45. In certain embodiments the promoter and/or enhancer region can act as a 
constitutive promoter and/or enhancer to maximize expression of the region of the 
transcription unit to be transcribed. In certain constructs the promoter and/or enhancer 
region be active in all eukaryotic cell types, even if it is only expressed in a particular 

10 type of cell at a particular time. A preferred promoter of this type is the CMV promoter 
(650 bases). Other preferred promoters are SV40 promoters, cytomegalovirus (full 
length promoter), and retroviral vector LTR. 

46. It has been shown that all specific regulatory elements can be cloned and 
used to construct expression vectors that are selectively expressed in specific cell types 

1 5 such as melanoma cells. The glial fibrillary acetic protein (GFAP) promoter has been 
used to selectively express genes in cells of glial origin. 

47. Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) may also contain sequences necessary for the 
termination of transcription which may affect mRNA expression. These regions are 

20 transcribed as polyadenylated segments in the untranslated portion of the mRNA 

encoding tissue factor protein. The 3' untranslated regions also include transcription 
termination sites. It is preferred that the transcription unit also contain a 
polyadenylation region. One benefit of this region is that it increases the likelihood that 
the transcribed unit will be processed and transported like mRNA. The identification 

25 and use of polyadenylation signals in expression constructs is well established. It is 
preferred that homologous polyadenylation signals be used in the transgene constructs. 
In certain transcription units, the polyadenylation region is derived from the SV40 early 
polyadenylation signal and consists of about 400 bases. It is also preferred that the 
transcribed units contain other standard sequences alone or in combination with the 

30 above sequences improve expression from, or stability of, the construct. 
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Markers 

48. The viral vectors can include nucleic acid sequence encoding a marker 
product (reporter gene). This marker product is used to determine if the gene has been 
delivered to the cell and once delivered, is being expressed. Marker genes can be but 

5 are not limited to the E. Coli lacZ gene, which encodes B-galactosidase, and green 
fluorescent protein. 

49. In some embodiments the marker may be a selectable marker. Examples of 
suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), 
thymidine kinase, neomycin, neomycin analog G418, hydromycin, and puromycin. 

10 When such selectable markers are successfully transferred into a mammalian host cell, 
the transformed mammalian host cell can survive if placed under selective pressure. 
There are two widely used distinct categories of selective regimes. The first category is 
based on a cell's metabolism and the use of a mutant cell line which lacks the ability to 
grow independent of a supplemented media. Two examples are: CHODHFR- cells 

15 and mouse LTK- cells. These cells lack the ability to grow without the addition of such 
nutrients as thymidine or hypoxanthine. Because these cells lack certain genes 
necessary for a complete nucleotide synthesis pathway, they cannot survive unless the 
missing nucleotides are provided in a supplemented media. An alternative to 
supplementing the media is to introduce an intact DHFR or TK gene into cells lacking 

20 the respective genes, thus altering their growth requirements. Individual cells which 
were not transformed with the DHFR or TK gene will not be capable of survival in 
non-supplemented media, 

50. The second category is dominant selection which refers to a selection 
scheme used in any cell type and does not require the use of a mutant cell line. These 

25 schemes typically use a drug to arrest growth of a host cell. Those cells which have a 
novel gene would express a protein conveying drug resistance and would survive the 
selection. Examples of such dominant selection use the drugs neomycin, (Southern P. 
and Berg, P.. J. Molec. AddI. Genet. 1: 327 (1982)), mycophenolic acid, (Mulligan, 
R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, (Sugden, B. et al., Mol. 

30 Cell. Biol. 5: 410-413 (1985)). The three examples employ bacterial genes under 
eukaryotic control to convey resistance to the appropriate drug G418 or neomycin 
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(geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others include the 
neomycin analog G418 and puramycin. 
Treatment and Prevention Methods 

51. By "treating" is meant an improvement in the disease state (i.e., viral 
5 infection, bacterial infection, parasitic infection, cancer, genetic disorder, or 

autoimmune disease) is observed and/or detected upon administration of a substance of 
the present invention to a subject. Treatment can range from a positive change in a 
symptom or symptoms of the disease to complete amelioration of the viral infection, 
bacterial infection, parasitic infection, or cancer (e.g., reduction in severity or intensity 

10 of disease, alteration of clinical parameters indicative of the subject's condition, relief 
of discomfort or increased or enhanced function), as detected by art-known techniques. 
The methods of the present invention can be utilized to treat an established viral 
infection, bacterial infection, parasitic infection, or cancer. One of skill in the art would 
recognize that viral infection, bacterial infection, parasitic infection, or cancer refer to 

15 conditions characterized by the presence of a foreign pathogen or abnormal cell growth. 
Clinical symptoms will depend on the particular condition and are easily recognizable 
by those skilled in the art of treating the specific condition. Treatment methods can 
include, but are not limited to therapeutic vaccinations. Thus, Disclosed are metiiods of 
treating a subject with a condition comprising administering to the vector of the 

20 invention. 

52. Also disclosed are methods of the invention, wherein the condition being 
treated is a viral infection. The viral infection can be selected from the list of vimses 
consisting of Herpes simplex virus type-1, Herpes simplex virus type-2, 
Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus, Human herpesvirus 6, 

25 Human herpesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus. 
Hepatitis A virus. Hepatitis B virus, Hepatitis C virus. Hepatitis D virus. Hepatitis E 
virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Human Papilomavirus, Respiratory syncytial virus, Adenovirus, 
Coxsackie virus. Dengue virus. Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 

30 virus. Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus. Eastern 

Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 
Murray Valley fever virus, West Nile virus. Rift Valley fever virus. Rotavirus A, 
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Rotavirus B, Rotavirus C, Sindbis virus. Simian Immunodeficiency cirus, Human T- 
cell Leukemia virus type-1, Hantavirus, Rubella virus. Simian Inunimodeficiency virus. 
Human Immunodeficiency virus type-1, and Human Immunodeficiency virus type-2. 

53. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a virus. The viral antigen can be selected from the 
group of virases consisting of Herpes simplex virus type-1, Herpes simplex virus type- 
2, Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus. Human herpesvirus 6, 
Human herpesvirus 7, Human herpesvirus 8, Variola virus, Vesicular stomatitis virus, 
Hepatitis A virus. Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E 
virus, Rhino virus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Human Papilomavirus, Respiratory syncytial virus. Adenovirus, 
Coxsackie virus. Dengue virus. Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 
virus, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus. Eastern 
Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 
Murray Valley fever virus, West Nile vims, Rifl Valley fever virus. Rotavirus A, 
Rotavirus B, Rotavirus C, Sindbis virus, Simian Immunodeficiency cirus. Human T- 
cell Leukemia virus type-1, Hantavirus, Rubella virus. Simian Immunodeficiency virus. 
Human Immunodeficiency virus type-1, and Human Immunodeficiency vims type-2. 

54. Also disclosed are methods of the invention, wherein the condition being 
treated is a bacterial infection. The bacterial infection can be selected from the list of 
bacterium consisting of M tuberculosis, M bovis, M. bovis strain BCG, BCG 
substrains, M. avium, M intracellulare, M africanum, M. kansasii, M marinum, M 
ulcerans, M, avium subspecies paratuberculosis, Nocardia asteroides, other Nocardia 
species, Legionella pneumophila^ other Legionella species. Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 
Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumoniae. 
Listeria monocytogenes. Listeria ivanovii, Brucella abortus, other Brucella species, 
Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaci, Coxiella burnetti, other Rickettsial species, Ehrlichia species, Staphylococcus 
aureus. Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus 
agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
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Pseudomonas species, Haemophilus influenzae^ Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia 
enterolitica, and other Yersinia species. 

55. Also disclosed are methods of the invention, wherein the antigen-encoding 
5 nucleic acid is an antigen from a bacterium. The bacterial antigen can be selected from 

the group consisting of M. tuberculosis, M, bovis, M. bovis strain BCG, BCG 
substrains, Af. avium, M. intracellular^ M, africanum, M kansasii, M marinum, M, 
ulcerans, M. avium subspecies paratuberculosis, Nocardia asteroides, other Nocardia 
species, Legionella pneumophila, other Legionella species. Salmonella typhi, other 

10 Salmonella species. Shigella species. Yersinia pestis, Pasteur ella haemolytica, 

Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumonia^ 
Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other Brucella species, 
Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaciy Coxiella burnetti, other Rickettsial species. Ehrlichia species, Staphylococcus 

15 aureus. Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus 

agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia enterolitica, 

20 and other Yersinia species. 

56. Also disclosed are methods of the invention, wherein the condition being 
treated is a parasitic infection. The parasitic infection can be selected from the list of 
parasites consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium vivax, 
Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, Trypanosoma 

25 cruzi, Leishmania major, other Leishmania species.. Schistosoma mansoni, other 
Schistosoma species., and Entamoeba histolytica. 

SI. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a parasite. The parasitic antigen can be selected from 
the group consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 

30 vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 
Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica. 
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58. Also disclosed are methods of the invention, wherein the condition being 
treated is cancer. 

59. The disclosed vectors and vector containing compositions can be used to 
treat any disease where uncontrolled cellular proliferation occurs such as cancers. A 

5 non-limiting list of different types of cancers is as follows: lymphomas (Hodgkins and 
non-Hodgkins), leukemias, carcinomas, carcinomas of solid tissues, squamous cell 
carcinomas, adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastomas, 
neuroblastomas, plasmacytomas, histiocytomas, melanomas, adenomas, hypoxic 
tumours, myelomas, AIDS-related lymphomas or sarcomas, metastatic cancers, or 
10 cancers in general. 

60. A representative but non-limiting list of cancers that the disclosed 
compositions can be used to treat is the following: lymphoma, B cell lymphoma, T cell 
lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, 
brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma 

15 of head and neck, kidney cancer, lung cancers such as small cell lung cancer and non- 
small cell lung cancer, neuroblastoma/glioblastoma, ovarian cancer, pancreatic cancer, 
prostate cancer, skin cancer, liver cancer, melanoma, squamous cell carcinomas of the 
mouth, throat, larynx, and lung, colon cancer, cervical cancer, cervical carcinoma, 
breast cancer, and epithelial cancer, renal cancer, genitourinary cancer, pulmonary 

20 cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, 

hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, or 
pancreatic cancer. 

61. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is a tumor antigen. The tumor antigen can be selected from the list 

25 consisting of human epithelial cell mucin (Muc-1 ; a 20 amino acid core repeat for Muc- 
1 glycoprotein, present on breast cancer cells and pancreatic cancer cells), the Ha-ras 
oncogene product, p53, carcino-embryonic antigen (CEA), the raf oncogene product, 
gplOO/pmein, GD2, GD3, GM2, TF, sTn, MAGE-1, MAGE-3, BAGE, GAGE, 
tyrosinase, gp75, Melan-A/Mart-l, gplOO, HER2/neu, EBV-LMP 1 & 2, HPV.F4, 6, 7, 

30 prostate-specific antigen (PSA), HP V- 1 6, MUM, alpha-fetoprotein (AFP), CO 1 7- 1 A, 
GA733, gp72, p53, the ras oncogene product, HPV E7, Wilm's tumor antigen-1, 
telomerase, and melanoma gangliosides. 
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62. Disclosed are methods of treating a condition in a subject comprising 
administering to the subject the vector of the invention, wherein the condition is due to 
a mutated, disregulated, disrupted, or deleted gene; autoimmunity; or inflammatory 
diseases. 

5 63. Disclosed are methods of treating a condition in a subject, wherein the 

condition can be selected from list consisting of cystic fibrosis, asthma, multiple 
sclerosis, muscular dystrophy, diabetes, tay-sachs, spinobifida, cerebral palsy, 
Parkinson's disease, lou gehrigg disease, alzheimer's, systemic lupus erythamatosis, 
hemophelia, Addsion's disease, Cushing's disease. 

10 64. By "preventing" is meant that after administration of a substance of the 

present invention to a subject, the subject does not develop the symptoms of the viral, 
bacterial, or parasitic infection, and/or does not develop the viral, bacterial, or parasitic 
infection. "Preventing" or "prevention" can also refer to the ultimate reduction of an 
infection, condition, or symptoms of an infection, or condition relative to infections or 

15 conditions in subjects that do not receive the substance. Methods of prevention can 
include, but are not limited to profilactic vaccination. As such, disclosed are methods 
of preventing an infection in a subject comprising administering to the subject the 
vector of the invention. 

65. Also disclosed are methods of the invention, wherein the infection 

20 prevented is a viral infection. The viral infection can be selected from the list of 
viruses consisting of Herpes simplex virus type- 1, Herpes simplex virus type-2. 
Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus. Human herpesvirus 6, 
Human herpesvirus 7, Human herpesvirus 8, Variola virus, Vesicular stomatitis virus. 
Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Hepatitis D virus. Hepatitis E 

25 virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Human Papilomavirus, Respiratory syncytial virus. Adenovirus, 
Coxsackie virus. Dengue virus, Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 
virus, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus, Eastem 
Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 

30 Murray Valley fever virus. West Nile virus. Rift Valley fever virus, Rotavirus A, 

Rotavirus B, Rotavirus C, Sindbis virus, Simian Immunodeficiency cirus. Human T- 
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cell Leukemia virus type-1, Hantavirus, Rubella virus, Simian Immvmodeficiency virus. 
Human Immunodeficiency virus type-1, and Human Immunodeficiency virus type-2. 

66. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a virus. The viral antigen can be selected from the 

5 group consisting of Herpes simplex virus type- 1 , Herpes simplex virus type-2, 

Cytomegalovirus, Epstein-Barr virus. Varicella-zoster vims. Human herpesvirus 6, 
Human herpesvims 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus, 
Hepatitis A virus, Hepatitis B virus, Hepatitis C virus. Hepatitis D virus, Hepatitis E 
virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 

10 Polyomavirus, Human Papilomavirus, Respiratory syncytial virus, Adenovirus, 

Coxsackie virus, Dengue virus. Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 
virus, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus. Eastern 
Equine Encephalitis virus, Japanese Encephalitis vims, St. Louis Encephalitis virus, 
Murray Valley fever vims. West Nile vims. Rift Valley fever vims, Rotavims A, 

15 Rotavirus B, Rotavims C, Sindbis vims. Simian Immunodeficiency cims. Human T- 
cell Leukemia vims type-1, Hantavims, Rubella vims. Simian Immunodeficiency vims. 
Human Immunodeficiency vims type-1, and Human Immunodeficiency vims type-2. 

67. Also disclosed are methods of the invention, wherein the infection 
prevented is a bacterial infection. The bacterial infection can be selected from the list 

20 of bacterium consisting of Af. tuberculosis, M bovis, M bovis strain BCG, BCG 

substrains, M avium, M, intracellulare, M, africanum, M kansasii, M marinum, M. 
ulcerans, M, avium subspecies paratuberculosis, Nocardia asteroides, other Nocardia 
species, Legionella pneumophila, other Legionella species. Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 

25 Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumoniae. 
Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other Brucella species, 
Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaci, Coxiella burnetti, other Rickettsial species. Ehrlichia species. Staphylococcus 
aureus. Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus 

30 agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 

species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 

20 
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Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia 
enterolitica, and other Yersinia species. 

68. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a bacterium. The bacterial antigen can be selected from 
5 the group consisting of M tuberculosis, M. bovis, M bovis strain BCG, BCG 

substrains, M avium^ M intracellulare, M. africanum, M, kansasii, M. marinum, M 
ulcerans, M. avium subspecies paratuberculosis, Nocardia asteroides, other Nocardia 
species, Legionella pneumophila, other Legionella species, Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 

10 Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumonia^ 
Listeria monocytogenes. Listeria ivanovii, Brucella abortus, other Brucella species, 
Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaci, Coxiella burnetti, other Rickettsial species. Ehrlichia species. Staphylococcus 
aureus. Staphylococcus epidermidis. Streptococcus pyogenes, Streptococcus 

15 agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter c 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia enterolitica, 
and other Yersinia species. 

20 69. Also disclosed are methods of the invention, wherein the infection 

prevented is a parasitic infection. The parasitic infection can be selected from the list 
of parasites consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 
vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 
Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 

25 mansoni, other Schistosoma species., and Entamoeba histolytica. 

70. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a parasite. The parasitic antigen can be selected from 
the group consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 
vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 

30 Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica. 
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71. Also disclosed are methods of the invention, wherein the subject is a horse, 
cow, pig, dog, car, mouse, monkey, human, or a cell isolated from such an animal. 
Delivery of the compositions to cells 

72. There are a number of compositions and methods which can be used to 

5 deliver nucleic acids to cells, either in vitro or in vivo. For example, the nucleic acids 
can be delivered through a number of direct delivery systems such as, electroporation, 
lipofection, calcium phosphate precipitation, and the present viral vectors. Appropriate 
means for transfection, including the present viral vectors, chemical transfectants, or 
physico-mechanical methods such as electroporation and direct diffusion of DNA, are 

10 described by, for example, Wolff, J. A., et al.. Science, 247, 1465-1468, (1990); and 

Wolff, J. A. Nature, 352, 815-818, (1991). Such methods are well known in the art and 
readily adaptable for use with the compositions and methods described herein. In 
certain cases, the methods will be modified to specifically function with large DNA 
molecules. Further, these methods can be used to target certain diseases and cell 

1 5 populations by using the targeting characteristics of the carrier. 

73. Transfer vectors can be any nucleotide construction used to deliver genes 
into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part 
of recombinant retrovirus or adenovims (Ram et al. Cancer Res. 53:83-88, (1993)). 

74. As used herein, viral vectors such as pHSV are agents that transport the 
20 disclosed antigen-encoding nucleic acids, such as pi 7/24 into the cell without 

degradation and include a promoter yielding expression of the gene in the cells into 
which it is delivered. Retroviral vectors especially spumavirus vectors are able to carry 
a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors. A 
preferred embodiment is a viral vector which has been engineered so as to induce the 
25 immune response of the host organism, elicited by the peptides encoded on the vector. 

75. Viral vectors can have higher transaction abilities (ability to introduce 
genes) than chemical or physical methods to introduce genes into cells. Typically, viral 
vectors contain, nonstructural and structural genes, a polymerase, inverted terminal 
repeats necessary for replication and encapsidation, and promoters to control the 

30 transcription and replication of the viral genome. When engineered as vectors, viruses 
typically have one or more of the early genes removed and a gene or gene/promotor 
cassette is inserted into the viral genome in place of the removed viral DNA. 
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Constructs of this type can carry large fragments of foreign genetic material. The 
necessary functions of the removed early genes are typically supplied by cell lines 
which have been engineered to express the gene products of the early genes in trans. 

76. Spumaviruses are retroviruses. A retrovirus is an animal virus belonging to 
the virus family of Retro viridae, including any types, subfamilies, genus, or tropisms. 
Retroviral vectors, in general, are described by Verma, LM., Retroviral vectors for gene 
transfer. In Microbiology-1985, American Society for Microbiology, pp. 229-232, 
Washington, (1985), which is incorporated by reference herein. Examples of methods 
for using retroviral vectors for gene therapy are described in U.S. Patent Nos. 4,868,1 16 
and 4,980,286; PCX applications WO 90/02806 and WO 89/07136; and Mulligan, 
(Science 260:926-932 (1993)); the teachings of which are incorporated herein by 
reference. Although the present spumavirus vector is unique, the methods described for 
using other types of viral vectors can be useful in certain contexts. See for example 
U.S. Patent No. 5,646,032, which is incorporated herein for its teaching of those 
methods. 

77. A retrovirus is essentially a package which has packed into it nucleic acid 
cargo. The nucleic acid cargo carries with it a packaging signal, which ensures that the 
replicated daughter molecules will be efficiently packaged within the package coat. In 
addition to the package signal, there are a number of molecules which are needed in cis, 
for the replication, and packaging of the replicated virus. Typically a retroviral 
genome, contains the gag, pol, and env genes which are involved in the making of the 
protein coat. It is the gag, pol, and env genes which are typically replaced by the 
foreign DNA that is to be transferred to the target cell. Retrovirus vectors typically 
contain a packaging signal for incorporation into the package coat, a sequence which 
signals the start of the gag transcription unit, elements necessary for reverse 
transcription, including a primer binding site to bind the tRNA primer of reverse 
transcription, terminal repeat sequences that guide the switch of RNA strands during 
DNA synthesis, a purine rich sequence 5* to the 3' LTR that serves as the priming site 
for the synthesis of the second strand of DNA synthesis, and specific sequences near 
the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to 
insert into the host genome. The removal of the gag, pol, and env genes allows for 
large fragments of foreign sequence to be inserted into the viral genome, become 
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reverse transcribed, and upon replication be packaged into a new retroviral particle. 
This amount of nucleic acid is sufficient for the delivery of a one to many genes 
depending on the size of each transcript. It is preferable to include either positive or 
negative selectable markers along with other genes in the insert. 
5 78. A packaging cell line is a cell line which has been transfected or 

transformed with a retrovirus that contains the replication and packaging machinery, 
but lacks any packaging signal. When the vector carrying the DNA of choice is 
transfected into these cell lines, the vector containing the gene of interest is replicated 
and packaged into new retroviral particles, by the machinery provided in cis by the 
10 helper cell. The genomes for the machinery are not packaged because they lack the 
necessary signals. 

In vivo/ex vivo 

79. As described herein, the vector-containing compositions can be 
administered in a pharmaceutically acceptable carrier and can be delivered to the 

1 5 subjectDs cells in vivo and/or ex vivo by a variety of mechanisms well known in the art 
(e.g., uptake of naked DNA, liposome fusion, intramuscular injection of DNA via a 
gene gun, endocytosis and the like). 

80. If ex vivo methods are employed, cells or tissues can be removed and 
maintained outside the body according to standard protocols well known in the art. The 

20 compositions can be introduced into the cells via any gene transfer mechanism, such as, 
for example, calcium phosphate mediated gene delivery, electroporation, 
microinjection or proteoliposomes. The transduced cells can then be infused (e.g., in a 
pharmaceutically acceptable carrier) or homotopically transplanted back into the 
subject per standard methods for the cell or tissue type. Standard methods are known 

25 for transplantation or infusion of various cells into a subject. 
Nucleic Acid Delivery 

81. In the methods described above which include the administration and uptake 
of exogenous DNA into the cells of a subject (i.e., gene transduction or transfection), 
the nucleic acids of the present invention can be in the form of naked DNA or RNA, or 

30 the nucleic acids can be in a vector for delivering the nucleic acids to the cells, whereby 
the antibody-encoding DNA firagment is under the transcriptional regulation of a 
promoter, as would be well understood by one of ordinary skill in the art. 
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82. As one example, if the antigen-encoding nucleic acid of the invention is 
delivered to the cells of a subject in a spumavirus vector, the dosage for administration 
of spumavirus to humans can range from about 10^ to 10^ plaque forming units (pfu) 
per injection but can be as high as 10*^ to lO'^ pfu per injection. In some cases lower 
5 dosages (eg., 10^, 10"*, 10^, and 10^ pfu) can be effective. A subject can receive a single 
injection, or, if additional injections are necessary, they can be repeated at two, four, six 
month intervals (any intervening time intervals or other appropriate time intervals, as 
determined by the skilled practitioner) for an indefinite period and/or until the efficacy 
of the treatment has been established. 

10 83. Parenteral administration of the nucleic acid or vector of the present 

invention, if used, is generally characterized by injection. Injectables can be prepared 
in conventional forms, either as liquid solutions or suspensions, solid forms suitable for 
solution of suspension in liquid prior to injection, or as emulsions. A more recently 
revised approach for parenteral administration involves use of a slow release or 

15 sustained release system such that a constant dosage is maintained. See, e.g., U.S. 
Patent No. 3,610,795, which is incorporated by reference herein. For additional 
discussion of suitable formulations and various routes of administration of therapeutic 
compounds, see, e.g.. Remington: The Science and Practice of Pharmacy (19th ed.) ed. 
A.R. Gennaro, Mack Publishing Company, Easton, PA 1995. 

20 Delivery of pharmaceutical products 

84. The vector or vector-containing compositions can be administered in vivo in 
a pharmaceutically acceptable carrier. By "pharmaceutically acceptable" is meant a 
material that is not biologically or otherwise undesirable, i.e., the material may be 
administered to a subject, along with the nucleic acid or vector, without causing any 

25 undesirable biological effects or interacting in a deleterious manner with any of the 
other components of the pharmaceutical composition in which it is contained. The 
carrier would naturally be selected to minimize any degradation of the active ingredient 
and to minimize any adverse side effects in the subject, as would be well known to one 
of skill in the art. 

30 85. The compositions may be administered orally, parenterally (e.g., 

intravenously), by intramuscular injection, by intraperitoneal injection, subcutaneously, 
transdermally, extracorporeally, topically, gene gun or the like, although topical 

25 
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intranasal administration or administration by inhalant is typically preferred. As used 
herein, "topical intranasal administration" means delivery of the compositions into the 
nose and nasal passages through one or both of the nares and can comprise delivery by 
a spraying mechanism or droplet mechanism, or through aerosolization of the nucleic 
5 acid or vector. The latter may be effective when a large number of animals is to be 
treated simultaneously. Administration of the compositions by inhalant can be through 
the nose or mouth via delivery by a spraying or droplet mechanism. Delivery can also 
be directly to any area of the respiratory system (e.g., lungs) via intubation. The exact 
amount of the compositions required will vary from subject to subject, depending on 

10 the species, age, weight and general condition of the subject, the severity of the allergic 
disorder being treated, the particular nucleic acid or vector used, its mode of 
administration and the like. Thus, it is not possible to specify an exact amount for 
every composition. However, an appropriate amount can be determined by one of 
ordinary skill in the art using only routine experimentation given the teachings herein. 

15 86. The materials may be in solution, suspension (for example, incorporated 

into microparticles, liposomes, or cells). These may be targeted to a particular cell type 
via antibodies, receptors, or receptor ligands. The following references are examples of 
the use of this technology to target specific proteins to tumor tissue (Senter, et al., 
Bioconiugate Chem.. 2:447-451, (1991); Bagshawe, K.D., Br. J. Cancen 60:275-281, 

20 (1989); Bagshawe, et al., Br. J. Cancen 58:700-703, (1988); Senter, et al., Bioconiugate 
Chem. . 4:3-9, (1993); Battelli, et al.. Cancer Immunol. Immunother. . 35:421-425, 
(1992); Pietersz and McKenzie. Immunolog. Reviews . 129:57-80, (1992); and Roffler, 
et al., Biochem. Pharmacol . 42:2062-2065, (1991)). Vehicles such as "stealth" and 
other antibody conjugated liposomes (including lipid mediated drug targeting to colonic 

25 carcinoma), receptor mediated targeting of DNA through cell specific ligands, 

lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting 
of murine glioma cells in vivo. The following references are examples of the use of this 
technology to target specific proteins to tumor tissue (Hughes et al.. Cancer Research . 
49:6214-6220, (1989); and Litzinger and Huang, Biochimica et Biophvsica Acta. 

30 1 1 04: 1 79- 1 87, ( 1 992)). In general, receptors are involved in pathways of endocytosis, 
either constitutive or ligand induced. These receptors cluster in clathrin-coated pits, 
enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which 
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the receptors are sorted, and then either recycle to the cell surface, become stored 
intracellularly, or are degraded in lysosomes. The internalization pathways serve a 
variety of functions, such as nutrient uptake, removal of activated proteins, clearance of 
macromolecules, opportunistic entry of viruses and toxins, dissociation and degradation 
5 of ligand, and receptor-level regulation. Many receptors follow more than one 
intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms 
of receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and 
Cell Biology 10:6,399-409(1991)). 
10 Pharmaceutically Acceptable Carriers 

87. The compositions, can be used therapeutically in combination with a 
pharmaceutically acceptable carrier. 

88. Pharmaceutical carriers are known to those skilled in the art. These most 
typically would be standard carriers for administration of drugs to humans, including 

15 solutions such as sterile water, saline, and buffered solutions at physiological pH. The r 
compositions can be administered intramuscularly or subcutaneously. Other * 
compounds will be administered according to standard procedures used by those skilled 
in the art. 

89. Pharmaceutical compositions may include carriers, thickeners, diluents, 
20 buffers, preservatives, surface active agents and the like in addition to the molecule of 

choice. Pharmaceutical compositions may also include one or more active ingredients 
such as antimicrobial agents, anti-inflammatory agents, anesthetics, and the like. 

90. The pharmaceutical composition may be administered in a number of ways 
depending on whether local or systemic treatment is desired, and on the area to be treated. 

25 Administration may be topically (including ophthalmically, vaginally, rectally, 

intranasally), orally, by inhalation, or parenterally, for example by intravenous drip, 
subcutaneous, intraperitoneal or intramuscular injection. The disclosed compositions can 
be administered intravenously, intraperitoneally, intramuscularly, subcutaneously, 
intracavity, by gene gun, or transdermally. 

30 91 . Preparations for parenteral administration include sterile aqueous or non- 

aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are 
propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable 
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organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous 
solutions, emulsions or suspensions, including saline and bufifered media. Parenteral 
vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium 
chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient 
5 replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), and the 
like. Preservatives and other additives may also be present such as, for example, 
antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

92. Formulations for topical administration may include ointments, lotions, 
creams, gels, drops, suppositories, sprays, liquids and powders. Conventional 

10 pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be 
necessary or desirable. 

93. Compositions for oral administration include powders or granules, suspensions 
or solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, 
flavorings, diluents, emulsifiers, dispersing aids or binders may be desirable. 

15 94. Some of the compositions may potentially be administered as a 

pharmaceutically acceptable acid- or base- addition salt, formed by reaction with 
inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric acid, nitric 
acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and organic acids such as 
formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic 

20 acid, malonic acid, succinic acid, maleic acid, and fumaric acid, or by reaction with an 
inorganic base such as sodium hydroxide, ammonium hydroxide, potassium hydroxide, 
and organic bases such as mono-, di-, trialkyl and aryl amines and substituted 
ethanolamines. 

Therapeutic Uses 

25 95. The substances of the present invention can be delivered at effective 

amounts or concentrations. An effective concentration or amount of a substance is one 
that results in treatment or prevention of the condition (e.g. HIV or AIDS). One skilled 
in the art would know how to determine an effective concentration or amount according 
to methods known in the art, as well as provided herein. One of skill in the art can 

30 utilize in vitro assays to optimize the in vivo dosage of a particular substance, including 
concentration and time course of administration. 
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96. Herein, immune modulatory substances are described. As defined immime 
modulation refers to any change in the immune response. This includes but is not 
limited to increase or decreases in the number of antigen specific plasma cells, memory 
B cells, memory T cells, activated CDS T cells, cytokine production, and cytolitic 

5 killing or the maintenance of homeostatic levels of the same when the responses would 
otherwise be changing. The dosage ranges for the administration of the substances are 
those large enough to produce the desired effect in which the symptoms of the disorder 
are affected. The dosage should not be so large as to cause adverse side effects, such as 
unwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage 
10 will vary with the age, condition, sex and extent of the disease in the patient and can be 
determined by one of skill in the art. The dosage can be adjusted by the individual 
physician in the event of any contraindications. Dosage can vary, and can be 
administered in one or more dose administrations daily, for one or several days. 

97. For example, to evaluate the efficacy of treatment of humans with a 

15 condition, such as for example, HIV, with a substance that modulates normal immune 
responses to HIV, the following studies can be performed. Patients with active 
infection can be selected. Drug efficacy can be monitored via viral titer, antibody 
ELISA or ELISPOT and CD4 T cell count. Patients can be randomized to two different 
protocols. In one protocol, subjects can remain on initial medication and in the second 

20 protocol, subjects can have their medication tapered after receiving the substance that 
modulates immune responses. 

98. In one embodiment, treatment can consist of either a single dosage of 0.3 
mg to 0.6 mg/animal of the vector expressing a substance that treats or prevents the 
condition. Additionally, dosage as low as 30)ig/animal to 300|xg/animal can be used 

25 with intramuscular injection of infecftious DNA, In one example, a DNA 

immunization comprising 0.3 mg of the pFOV-gag vector is administered to mouse via 
gene gun. After two weeks the subject is monitored using conventional assays to assess 
a generated immune response to pi 7/24 in the form of antibody production and antigen 
specific T cells. Having generated a response, the mouse is then challenged with an 

30 infectious dose of HIV. The mouse is then monitored for disease progression and 
clinical symptoms associated with HIV. Alternatively, a moxise can be infected with 
HIV and then given a therapeutic dose of the pFOV-gag. The animal is monitored for 
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changes in the viral lode, clinical progression of symptoms, as well as immune 
responses in comparison to non treated control animals. In a further example, a DNA 
immunization comprising O.OSmg (50ng) of the pFOV-gag vector is administered to 
mouse via intrauscular injection of infectious DNA. After two weeks the subject is 
5 monitored using conventional assays to assess a generated immune response to pi 7/24 
in the form of antibody production and antigen specific T cells. Having generated a 
response, the mouse is then challenged with an infectious dose of HIV. The mouse is 
then monitored for disease progression and clinical symptoms associated with HIV. 
Alternatively, a mouse can be infected with HIV and then given a therapeutic dose of 

10 the pFOV-gag. The animal is monitored for changes in the viral lode, clinical 

progression of symptoms, as well as immune responses in comparison to non treated 
control animals. The art of determining dosage for an animal based on size is well 
known. It is understood that a skilled artisan would be able to determine the proper 
dosage of a substance for an animal based on the dosage of the same substance 

1 5 administered to another animal of similar or different size. For example, wheras a 200g 
mouse would receive a 50|j.g dose of the immunization, a 1501b (68kg) human would 
be administered an immunization comprising about 17mg of the vector. 

99. Disclosed are methods of detecting the expression of the vector of the 
invention comprising using a first antibody to the antigen to measure protein expression 

20 in a quantitative or qualitative way, further comprising detecting the first antibody 

directly via a colorimetric measurement produced through the use of a substrate and a 
conjugated antibody or indirectly via a firstantibody to the antigen which in turn is 
bound by a second antibody which is conjugated and will result in a colorimetric 
measurement when combined with a substrate. 

25 100. Also disclosed are methods of the invention, wherein the antigen is 

detected by placing an aliquot of the vector of the invention in a lane on a gel and 
probing the gel for the antigen. 

101 . Disclosed are methods of detecting the expression of the vector of the 
invention comprising using a fluorescently labeled first antibody specific for the 

30 antigen and visualizing the antigen using a flow cytometer or fluorescence microscope. 

102. Also disclosed are methods of the invention, wherein the first antibody 
is not flourescently labeled, but a target for a second antibody with a fluorescent label. 
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103. Disclosed are methods of detecting the expression of the vector of claim 
the invention comprising using cj^olitic killing assay to assess activity. 

104. Disclosed are methods of detecting the vector of the invention further 
comprising obtaining a sample from a subject comprising a tissue biopsy or removal of 

5 blood or bone marrow. 
Screening Methods 

105. Also provided by the present invention is a method of screening a 
substance for effectiveness in treating or reducing the severity of the condition (e.g., 
HIV infection) comprising: a) obtaining an animal having the condition or 

10 characteristic (e.g., symptom) of the condition; b) administering the substance to an 

animal having one or more characteristics of the condition; and assaying the animal for 
an effect on the condition, thereby identifying a substance effective in reducing the 
condition. 

106. The ability of a substance to reduce the severity of a condition can be 
1 5 determined by evaluating the histological and clinical manifestations, of the animal 

witii condition before and after administration of the substance of interest and 
quantitating the amount of reduction of the condition. 

107. The animal in which the condition or characteristic (e.g., symptom) of 
the condition is produced can be any mammal and can include but is not limited to 

20 mouse, rat, guinea pig, hamster, rabbit, cat, dog, goat, monkey, and chimpanzee. The 
condition or characteristic (e.g., symptom) of the condition can be produced in the 
animal by any method known in the art. For example, HIV can be produced by 
introducing into the animal (eg., chimpanzee infected with HIV or rhesus macaques or 
nemestrina macaques infected with an HIV-l env on an SIV backbone. Pullium , JK, et. 

25 al., J. Infectious Dis. 183:1023, 2001) an infectious amount of HIV. 

108. The present invention also provides a method of screening for a 
substance effective in preventing the condition (e.g., HTV infection) comprising: a) 
administering the substance to an animal susceptible to the condition; b) subjecting the 
animal to treatment that will induce the condition or characteristic (e.g., symptom) of 

30 the condition; and c) assaying cells from the animal for an change in immune responses 
as compared to an the immune responses in a control animal having the condition in the 
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absence of the substance identifies a substance that is effective in preventing the 
condition. 

109. For example, the methods of measuring the amount of pi 7/24 or other 
HIV antigen in an animal include, but are not limited to, ELISA, PGR, FACS analysis, 

5 reverse-transcriptase-polymerase chain reaction and ELISPOT, Northern blots, 
Southern blots, and Western blots. 

110. A model for use in screening for substances effective in treating or 
preventing a disease comprising an animal capable of manifesting a characteristic of the 
disease is provided, wherein the animal has been administered the vector of the 

10 invention. 

111. A method of making the model of the invention comprising obtaining 
administering to an animal capable of manifesting a characteristic of the disease and 
administering to said animal the vector of claim la vector of the invention which 
encodes an antigen associate with the disease. 

15 1 12. A method of screening for a substance effective in treating a disease 

associated with an immunizing construct comprising: a) administering the substance to 
the model of the invention; and b) assaying for an change in the course of the disease as 
compared to an the course of the disease in a control subject; an improvement in the 
course of the disease in the presence of the substance identifies a substance that is 

20 effective in treating the disease. 

113. A method of screening for a substance effective in preventing a disease 
associated with an immunizing construct comprising: a) administering a the vector of 
the invention to a subject; b) subjecting the subject to treatment that will induce the 
disease or characteristic (e.g., symptom) of the disease; and c) assaying for an change 

25 in the course of the disease as compared to an the course of the disease in a control 

subject; an improvement in the course of the disease in the presence of the substance identifies a 
substance that is effective in preventing the disease. 

114. A method of screening for a substance effective in treating a disease 
associated with an immunizing construct comprising: a) subjecting the subject to 

30 treatment that will induce the disease or characteristic (e.g., symptom) of the disease; b) 
administering a the vector of the invention to a subject; and c) assaying for an change in 
the course of the disease as compared to an the course of the disease in a control 
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subject; an improvement in the course of the disease in the presence of the substance identifies a 
substance that is effective in treating the disease. 
Sequence similarities 

115. It is understood that as discussed herein the use of the terms homology 
5 and identity mean the same thing as similarity. Thus, for example, if the use of the 

word homology is used between two non-natural sequences it is understood that this is 
not necessarily indicating an evolutionary relationship between these two sequences, 
but rather is looking at the similarity or relatedness between their nucleic acid 
sequences. Many of the methods for determining homology between two 
10 evolutionarily related molecules are routinely applied to any two or more nucleic acids 
or proteins for the purpose of measuring sequence similarity regardless of whether they 
are evolutionarily related or not. 

116. In general, it is understood that one way to define any known variants 
and derivatives or those that might arise, of the disclosed genes and proteins herein, is 

1 5 through defining the variants and derivatives in terms of homology to specific known 
sequences. This identity of particular sequences disclosed herein is also discussed 
elsewhere herein. In general, variants of genes and proteins herein disclosed typically 
have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 
88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence 

20 or the native sequence. Those of skill in the art readily understand how to determine 
the homology of two proteins or nucleic acids, such as genes. For example, the 
homology can be calculated after aligning the two sequences so that the homology is at 
its highest level. 

1 17. Another way of calculating homology can be performed by published 
25 algorithms. Optimal alignment of sequences for comparison may be conducted by the 

local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 
(1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. 
Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms 
30 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection. 
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118. The same types of homology can be obtained for nucleic acids by for 
example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et aL 
Proc, Natl Acad. ScL USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 
183:281-306, 1989 which are herein incorporated by reference for at least material 

S related to nucleic acid alignment. It is understood that any of the methods typically can 
be used and that in certain instances the results of these various methods may differ, but 
the skilled artisan understands if identity is found with at least one of these methods, 
the sequences would be said to have the stated identity, and be disclosed herein. 

119. For example, as used herein, a sequence recited as having a particular 
10 percent homology to another sequence refers to sequences that have the recited 

homology as calculated by any one or more of the calculation methods described 
above. For example, a first sequence has 80 percent homology, as defined herein, to a 
second sequence if the first sequence is calculated to have 80 percent homology to the 
second sequence using the Zuker calculation method even if the first sequence does not 

15 have 80 percent homology to the second sequence as calculated by any of the other 

calculation methods. As another example, a first sequence has 80 percent homology, as 
defined herein, to a second sequence if the first sequence is calculated to have 80 
percent homology to the second sequence using both the Zuker calculation method and 
the Pearson and Lipman calculation method even if the first sequence does not have 80 

20 percent homology to the second sequence as calculated by the Smith and Waterman 
calculation method, the Needleman and Wunsch calculation method, the Jaeger 
calculation methods, or any of the other calculation methods. As yet another example, 
a first sequence has 80 percent homology, as defined herein, to a second sequence if the 
first sequence is calculated to have 80 percent homology to the second sequence using 

25 each of calculation methods (although, in practice, the different calculation methods 
will often result in different calculated homology percentages). 
Nucleic acids 

120. There are a variety of molecules disclosed herein that are nucleic acid 
based, including for example the nucleic acids that encode, for example HIV-1 gag, as 

30 well as various functional nucleic acids. The disclosed nucleic acids are made up of for 
example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting 
examples of these and other molecules are discussed herein. It is understood that for 
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example, when a vector is expressed in a cell, that the expressed mRNA will typically 
be made up of A, C, G, and U. Likewise, it is understood that if, for example, an 
antisense molecule is introduced into a cell or cell environment through for example 
exogenous delivery, it is advantageous that the antisense molecule be made up of 
5 nucleotide analogs that reduce the degradation of the antisense molecule in the cellular 
environment. 

Sequences 

121. There are a variety of sequences related to the gag gene that are publicly 
available (eg., Genbank Accession Number: L03707), these sequences and others are 

10 herein incorporated by reference in their entireties as well as for individual 
subsequences contained therein. 

122. One particular sequence set forth in gag that is publicly available and 
having Genbank accession number L03707 is used herein, as an example, to exemplify 
the disclosed compositions and methods. It is understood that the description related to 

15 this sequence is applicable to any sequence related to gag unless specifically indicated 
otherwise. Those of skill in the art understand how to resolve sequence discrepancies 
and differences and to adjust the compositions and methods relating to a particular 
sequence to other related sequences (i.e. sequences of gag). Primers and/or probes can 
be designed for any gag sequence given the information disclosed herein and known in 

20 the art. 

Peptides/ Protein variants 

123. There are numerous variants of protein antigens that are antigenic. For 
example, there are variants of the gag protein and protein that are known and herein 
contemplated. In addition to the known functional HIV-1 gag strain variants there are 

25 derivatives of the gag proteins which also function in the disclosed methods and 

compositions. Protein variants and derivatives are well understood to those of skill in 
the art and in can involve amino acid sequence modifications. For example, amino acid 
sequence modifications typically fall into one or more of three classes: substitutional, 
insertional or deletional variants. Insertions include amino and/or carboxyl terminal 

30 fusions as well as intrasequence insertions of single or multiple amino acid residues. 

Insertions ordinarily will be smaller insertions than those of amino or carboxyl terminal 
fusions, for example, on the order of one to four residues. Immunogenic fusion protein 
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derivatives, such as those described in the examples, are made by fusing a polypeptide 
sufficiently large to confer immunogenicity to the target sequence by cross-linking in 
vitro or by recombinant cell culture transformed with DNA encoding the fusion. 
Deletions are characterized by the removal of one or more amino acid residues from the 
5 protein sequence* Typically, no more than about from 2 to 6 residues are deleted at any 
one site within the protein molecule. These variants ordinarily are prepared by site 
specific mutagenesis of nucleotides in the DNA encoding the protein, thereby 
producing DNA encoding the variant, and thereafter expressing the DNA in 
recombinant cell culture. Techniques for making substitution mutations at 

10 predetermined sites in DNA having a known sequence are well known, for example 

Ml 3 primer mutagenesis and PGR mutagenesis. Amino acid substitutions are typically 
of single residues, but can occur at a number of different locations at once; insertions 
usually will be on the order of about from 1 to 10 amino acid residues; and deletions 
will range about from 1 to 30 residues. Deletions or insertions preferably are made in 

15 adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. Substitutions, 
deletions, insertions or any combination thereof may be combined to arrive at a final 
construct. The mutations must not place the sequence out of reading frame and 
preferably will not create complementary regions that could produce secondary mRNA 
structure. Substitutional variants are those in which at least one residue has been 

20 removed and a different residue inserted in its place. Such substitutions generally are 
made in accordance with the following Tables 1 and 2 and are referred to as 
conservative substitutions. 
TABLE 1 : Amino Acid Abbreviations 



Amino Acid 


Abbreviations 


Alanine 


Ala, A 


Aliosoleucine 


Alle 


Arginine 


Arg,R 


Asparagine 


Asn, N 


aspartic acid 


Asp, D 


Cysteine 


Cys,C 


glutamic acid 


Glu,E 


Glutamine 


Gln,0 


Glycine 


Gly,G 


Histidine 


His,H 


Isolelucine 


He, I 


Leucine 


Leu, L 



36 



wo 2004/003153 



•CTAJS2003/020325 



Amino Acid 


Abbreviations 


Lysine 


Lys, K 


Phenylalanine 


Phe, F 


Proline 


Pro, P 


pyroglutamic acidp 


Glu 


Serine 


Ser, S 


Threonine 


Thr, T 


Tyrosine 


Tyr.Y 


Tryptophan 


Trp, W 


Valine 


Val. V 



TABLE 2:Amino Acid Substitutions 

Original Residue Exemplary Conservative Substitutions, others are known 

in the art. 

Ala; Ser 

Arg; Lys, gin 

Asn; Gin; his 

Asp; Glu 

Cys; Ser 

Gin; Asn, lys 

Glu; Asp 

Gly; Pro 

His; Asn; gin 

He; Leu; val 

Leu; He; val 

Lys; Arg; gin; 

Met; Leu; ile 

Phe; Met; leu; tyr 

Ser; Thr 

Thr; Ser 

Trp; Tyr 

Tyr; Trp; phe 

Val; lie; leu 

124. Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those in Table 2, i.e., selecting 
5 residues that differ more significantly in their effect on maintaining (a) the structure of 
the polypeptide backbone in the area of the substitution, for example as a sheet or 
helical conformation, (b) the charge or hydrophobicity of the molecule at the target site 
or (c) the bulk of the side chain. The substitutions which in general are expected to 
produce the greatest changes in the protein properties will be those in which (a) a 
10 hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
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residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side 
chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative 
residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., 
5 phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in 
this case, (e) by increasing the number of sites for sulfation and/or glycosylation, 

125. The replacement of one amino acid residue with another that is 
biologically and/or chemically similar is known to those skilled in the art as a 
conservative substitution. For example, a conservative substitution would be replacing 

10 one hydrophobic residue for another, or one polar residue for another. The 

substitutions include combinations such as, for example, Gly, Ala; Val, He, Leu; Asp, 
Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively substituted 
variations of each explicitly disclosed sequence are included within the antigenic 
polypeptides encoded and expressed by the vectors provided herein. 

15 126. Substitutional or deletional mutagenesis can be employed to insert sites 

for N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of 
cysteine or other labile residues also may be desirable. Deletions or substitutions of 
potential proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the 
basic residues or substituting one by glutaminyl or histidyl residues. 

20 127. Certain post-translational derivatizations are the result of the action of 

recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl 
residues are frequently post-translationally deamidated to the corresponding glutamyl 
and asparyl residues. Alternatively, these residues are deamidated under mildly acidic 
conditions. Other post-translational modifications include hydroxylation of proline and 

25 lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of 
the o-amino groups of lysine, arginine, and histidine side chains (T.E. Creighton, 
Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 
79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of 
the C-terminal carboxyl. 

30 128. It is understood that one way to define the variants and derivatives of the 

disclosed proteins herein is through defining the variants and derivatives in terms of 
homology/identity to specific known sequences. For example, SEQ ID NO:3 sets forth 



38 



wo 2004/003153 ^rCT/US2003/020325 



a particular sequence of a gag protein. Specifically disclosed are variants of these and 
other proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% 
or 95% homology to the stated sequence. It is understood that those of skill in the art 
will recognize variants of a disclosed protein as being a variant. Particularly, there are 
5 numerous naturally occurring variants of viral (e.g., HIV) antigens. For example, it is 
understood that those of skill in the art would recognize that the particular gag protein 
disclosed in SEQ ID NO: 4 is a gag protein from a different HIV-1 isolate from the 
HIV-1 gag protein in SEQ ID NO: 3 and therefore a variant that can be used with 
equivalent results. Those of skill in the art readily understand how to determine the 

10 homology of two proteins. For example, the homology can be calculated after aligning 
the two sequences so that the homology is at its highest level. 

129. Another way of calculating homology can be performed by published 
algorithms. Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by 

15 the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 
(1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. 
Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection. 

20 130. The same types of homology can be obtained for nucleic acids by for 

example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. 
Proc, Natl Acad, Set USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol 
183:281-306, 1989 which are herein incorporated by reference for at least material 
related to nucleic acid alignment. 

25 131. It is understood that the description of conservative mutations and 

homology can be combined together in any combination, such as embodiments that 
have at least 70% homology to a particular sequence wherein the variants are 
conservative mutations. 

132. As this specification discusses various proteins and protein sequences it 

30 is understood that the nucleic acids that can encode those protein sequences are also 
disclosed. This would include all degenerate sequences related to a specific protein 
sequence, i.e. all nucleic acids having a sequence that encodes one particular protein 
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sequence as well as all nucleic acids, including degenerate nucleic acids, encoding the 
disclosed variants and derivatives of the protein sequences. Thus, while each particular 
nucleic acid sequence may not be written out herein, it is understood that each and 
every sequence is in fact disclosed and described herein through the disclosed protein 
5 sequence. For example, one of the many nucleic acid sequences that can encode the 
protein sequence set forth in SEQ ID NO: 3 is set forth in SEQ ID NO: 2. In addition, 
for example, a disclosed conservative derivative of SEQ ID NO: 4 is shown in SEQ ID 
NO: 5, where the isoleucine (I) at position 19 is changed to a valine (V). It is 
understood that for this mutation all of the nucleic acid sequences that encode this 

10 particular derivative of the gag gene of the LAI strain of HIV- 1 are also disclosed. It is 
also understood that while no amino acid sequence indicates what particular DNA 
sequence encodes that protein within an organism, where particular variants of a 
disclosed protein are disclosed herein, the known nucleic acid sequence that encodes 
that protein in the particular gag from which that protein arises is also known and 

1 S herein disclosed and described. 
Computer readable mediums 

133. It is understood that the disclosed nucleic acids and proteins can be 
represented as a sequence consisting of the nucleotides of amino acids. There are a 
variety of ways to display these sequences, for example the nucleotide guanosine can 

20 be represented by G or g. Likewise the amino acid valine can be represented by Val or 
V. Those of skill in the art understand how to display and express any nucleic acid or 
protein sequence in any of the variety of ways that exist, each of which is considered 
herein disclosed. Specifically contemplated herein is the display of these sequences on 
computer readable mediums, such as, commercially available floppy disks, tapes, chips, 

25 hard drives, compact disks, and video disks, or other computer readable mediums. 

Also disclosed are the binary code representations of the disclosed sequences. Those of 
skill in the art understand what computer readable mediums. Thus, computer readable 
mediums on which the nucleic acids or protein sequences are recorded, stored, or 
saved. 
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Methods of making the compositions 

134. The compositions disclosed herein and the compositions necessary to 
perform the disclosed methods can be made using any method known to those of skill 
in the art for that particular reagent or compound unless otherwise specifically noted. 

5 Nucleic acid synthesis 

135. For example, the nucleic acids, such as, the oligonucleotides to be used 
as primers can be made using standard chemical synthesis methods or can be produced 
using enzymatic methods or any other known method. Such methods can range from 
standard enzymatic digestion followed by nucleotide fragment isolation (see for 

10 example, Sambrook et aL, Molecular Cloning: A Laboratory Manual, 2nd Edition 

(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) 
to purely synthetic methods, for example, by the cyanoethyl phosphoramidite method 
using a Milligen or Beckman System IPlus DNA synthesizer (for example. Model 
8700 automated synthesizer of Milligen-Biosearch, Burlington, MA or ABI Model 

15 380B). Synthetic methods usefril for making oligonucleotides are also described by 
Ikuta et al, Ann. Rev, Biochem. 53:323-356 (1984), (phosphotriester and phosphite- 
triester methods), and Narang et a/,, Methods EnzymoL, 65:610-620 (1980), 
(phosphotriester method). Protein nucleic acid molecules can be made using known 
methods such as those described by Nielsen et al., Bioconjug, Chem. 5:3-7 (1994). 

20 136. Disclosed are cells produced by the process of transforming the cell with 

any of the disclosed nucleic acids. Disclosed are cells produced by the process of 
transforming the cell with any of the non-naturally occurring disclosed nucleic acids. 

137. Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed nucleic acids. Disclosed are any of the non-naturally 

25 occurring disclosed peptides produced by the process of expressing any of the disclosed 
nucleic acids. Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed non-natural nucleic acids. 

138. Disclosed are animals produced by the process of transfecting a cell 
within the animal with any of the nucleic acid molecules disclosed herein. Disclosed 

30 are animals produced by the process of transfecting a cell within the animal with any of 
the nucleic acid molecules disclosed herein, wherein the animal is a mammal. Also 
disclosed are animals produced by the process of transfecting a cell within the animal 
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with any of the nucleic acid molecules disclosed herein, wherein the mammal is mouse, 
rat, rabbit, cow, sheep, pig, or primate. 

139. Also disclose are animals produced by the process of adding to the 
animal any of the cells disclosed herein. 

S Methods of using the compositions as research tools 

140. The disclosed vectors, compositions and methods can be used for 
targeted gene disruption and modification in any animal that can undergo these events. 
Gene modification and gene disruption refer to the methods, techniques, and 
compositions involved in the selective removal or alteration of a gene or stretch of 

10 chromosome in an animal, such as a mammal, in a way that propagates the 

modification through the germ line of the mammal. In general, a cell is transformed 
with a vector which is designed to homologously recombine with a region of a 
particular chromosome contained within the cell, for example, as described herein. 
This homologous recombination event can produce a chromosome which has 

1 5 exogenous DNA introduced, for example in frame, with the surrounding DNA. This 
type of protocol allows for very specific mutations, such as point mutations, to be 
introduced into the genome contained within the cell. Methods for performing this type 
of homologous recombination are disclosed herein. 

141 . One of the preferred characteristics of performing homologous 
20 recombination in mammalian cells is that the cells should be able to be cultured, 

because the desired recombination events occur at a low frequency. 

142. Once the cell is produced through the methods described herein, an 
animal can be produced from this cell through either stem cell technology or cloning 
technology. For example, if the cell into which the nucleic acid was transfected was a 

25 stem cell for the organism, then this cell, after transfection and culturing, can be used to 
produce an organism which will contain the gene modification or disruption in germ 
line cells, which can in turn be used to produce another animal that possesses the gene 
modification or disruption in all of its cells. In other methods for production of an 
animal containing the gene modification or disruption in all of its cells, cloning 

30 technologies can be used. These technologies generally take the nucleus of the 

transfected cell and either through frision or replacement fuse the transfected nucleus 
with an oocyte which can then be manipulated to produce an animal. The advantage of 
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procedures that use cloning instead of ES technology is that cells other than ES cells 
can be transfected. For example, a fibroblast cell, which is very easy to culture can be 
used as the cell which is transfected and has a gene modification or disruption event 
take place, and then cells derived from this cell can be used to clone a whole animal. 
5 143. Throughout this application, various publications are referenced. The 

disclosures of these publications in their entireties are hereby incorporated by reference 
into this application in order to more fully describe the state of the art to which this 
invention pertains. The references disclosed are also individually and specifically 
incorporated by reference herein for the material contained in them that is discussed in 
10 the sentence or section in which the reference is cited. 

144. It will be apparent to those skilled in the art that various modifications 
and variations can be made in the present invention without departing from the scope or 
spirit of the invention. Other embodiments of the invention will be apparent to those 
skilled in the art from consideration of the specification and practice of the invention 

15 disclosed herein. It is intended that the specification and examples be considered as 
exemplary only, with a true scope and spirit of the invention being indicated by the 
following claims. 
Examples 

145. The following examples are put forth so as to provide those of ordinary 
20 skill in the art with a complete disclosure and description of how the vectors, 

compounds, compositions, and/or methods claimed herein are made and evaluated, and 
are intended to be purely exemplary of the invention and are not intended to limit the 
scope of what the inventors regard as their invention. Efforts have been made to ensure 
accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors 
25 and deviations should be accounted for. Unless indicated otherwise, parts are parts by 
weight, temperature is in or is at ambient temperature, and pressure is at or near 
atmospheric. 
Example 1 

146. In order to obtain the pl7/p24 fragment of the fflV-l gag gene, pSX 
30 plasmid containing the gag gene was subjected to 30 cycles of PCR using forward 

Apal-pl7 5' primer and reverse Notl-p24 3* primer. PCR product was cloned into a 
TA cloning vector (pCR2.1 Invitrogen; Carisbad, CA) and expanded. Fig. 1 shows a 
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map of the empty pHSV vector. The vector contains the viral envelope gene env as 
well as its structural gene gag and viral polymerase gene pol. The vector also possesses 
a transactivator (TAS) gene bel-1 and 2 LTR that flank the coding region of the vector. 
An inspection of the vector SEQ (SEQ ID NO: 1) shows where various restriction sites 
5 are located on the vector. A complete listing of the sites is found in Fig. 2 and these 
sites are shown in SEQ ID NO: 1. 

147. Inspection of the restriction map revealed Apal and Noil unique 
restriction sites located around the BET (Bel-2) gene at 13522 and 12816 respectively. 
As such, the BET gene was chosen as the site for integration of the exogenous nucleic 

10 acid (Figure 3). The pi 7/p24 were excised from the TA cloning vector using Apal and 
Not] restriction enzymes. Similarly, the empty pHSV vector was cut with Apal and 
Not]. 

148. The pl7/p24 segment of gag (789 (start of pl7) - 1876 (end of p24)) 
was ligated to pHS V and the pHSV-GAG was transformed into E. coli via 

1 5 electroporation. The bacteria was plated onto high Amp plates with the pHSV 

conferring ampR between the LTRs on the approximate 3kb plamid. Seven colonies 
were chosen and checked for insertion and proper orientation. Figure 4 shows that 
clone 2, 3, 4 when cut with Apal and Not] possessed a 1 kb fragment. This indicates 
that the pl7/p24 fragment is intact and properly oriented. 

20 149. Knowing that the construct was integrated into the vector, the next 

objective was to examine expression levels of the antigen. Integration is meaningless if 
the desired protein is not expressed. To rest expression of pl7/p24, the loaded pHSV 
vector was transfected using lipofectamine into BHK cells and supematants were 
collected at various time points post transfection. ELISA testing for p24 expression 

25 was conducted on the collected supematants (Fig. 5). 

1 50. Following transfection, clone 2 had clearly measurable p24 expression. 
The p24 expression level for clone 2 was increased at each time point after day 2, 
clones 3 and 4 showed no measurable expression of p24 until day 5 post transfection. 
Subsequent Elisa data revealed that 5 days post transfection clones 2, 3, and 4 could 

30 produce as much as 500 ng/ml, lOOOng/ml, and 800ng/ml of p24 respectively (Fig. 6). 
At the same time in vitro cytopathology was assessed. Cells from the day 2 cultures 
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had very little CPE. However, by day 5 CPE was readily observed. The presence of 
CPE shows that the p24 activity was associated with the replicating viral vector. 

151. Fig. 7 shows p24 expression in clone 2 after 2"^ pass, Supematants were 
collected at 3, 7 and 10 days after infection with diluted virus. At day 3 no p24 is 

5 observed. However, by day 7 lOOOng/ml of p24 is observed at the 10"^ dilution, which 
is confirmed at the 10"^ dilution with a measurement of lOOng/ml. By day 10, the 10"^ 
to 10"^ dilutions all had approximately 200ng/ml of p24. As such, day 7 undiluted virus 
would yield 10^ ng/ml p24 and at day 10 the secretion level ranges from 2x10"* to 2x10*^ 
ng/ml of p24. This is an extremely high expression level and indicates likely success in 
10 vivo as this vector is resulting in expression comparable to the highest 25% of vectors, 

152. As a verification that p24 Elisa data was actually measuring pl7/p24 
(p41) and not just p24, western blotting was performed (Fig. 8). Supematants from 
BHK cells infected with clone 2 pHSV-gag, empty pHSV, or mock infected (infected 
with PBS) were run on a polyacrylamide gel and probed for p24, HIV, or LTFOOl as a 

15 positive control. LTFOOl antibody bound to all 3 lanes. However, bands were 

apparent only in the lane for clone 2 when either anti-p24 or anti-HIV was used as a 
probe. The resulting bands show the smallest band to be 41kd, which corresponds to 
the size of pl7/p24. 

153. Histological observations were made from in vitro culture of pHSV in 
20 BHK cells. Cells infected with pHSV-gag showed CPE; however, this CPE is not 

usually observed in vivo. 
Methods 

Cloning and PCR 

154. The pi 7 and p24 segments of the LAI strain of HIV- 1 gag were cloned 
25 into a pHSV spumavirus vector. Briefly, pSX plasmid containing the LAI strain HIC-1 

gag gene was obtained from the HIV repository. The pSX plasmid was used as a 
template for PCR amplification and cloning of pl7 and p24, A forward Apal-pl7 5' 
primer TCC GGG CCC GGA ATG CCT ATA GTC CAG AAC ATC C (SEQ ID NO: 
6) and a reverse Notl-p24 3' primer GCG GCC GCG TTT TGA GAA CGA AAT 
30 ACC GG (SEQ ID NO: 7) were used to amplify p4 1 . The amplified segment was 
cloned into a TA cloning vector (pCR2.1; Invitrogen Carlsbad, CA) and expanded. 
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Restriction digest of the TA-p41 vector with Apal and Notl removed pi 7/24 from the 
vector. The p41 segment was then inserted into pHSV within the bet (bel'2) gene. 
Transformation 

155. The pHSV vector and pi 7/24 antigen were ligated together with ligase 
5 for 16hrs at 4C. Following ligation, pHSV-gag was electroporated into E. coli and 

incubated for Ihr at 37C in 2xyT broth media. The samples are then plated on high 
Ampicillin plates and incubated overnight at 37C. Colonies growing on the high Amp 
plates were selected and transferred to 2xyT broth media overnight at 37C. Minipreps 
were performed on the overnight cultures. Plasmids were cut with Apal and Notl and 
1 0 aliquots were run on 1 % agarose gel. 

ELISA and Western Blot 

156. ELISA and Western blot were perfomied using techniques commonly 
used in the art. 

Example 2 

15 157. Animal subjects can be used to screen the effectiveness of a pHSV 

vector and an antigen-encoding nucleic acid. Additionally, animal subjects may be 
used to study the vector-antigen combination's ability to prevent or treat a condition. 
The vector can also be used to induce a condition in an animal that is associate with a 
disease, and that animal can then be used to study the disease/condition or to study 

20 potential treatments for the disease/condition. Such conditions can be infections 
resulting from viruses, bacteria, or parasites; autoimmime reactions including 
inflammatory diseases, asthma, systemic lupus erythamatosis, muscular dystrophy, or 
multiple sclerosis, diabetes, tay-sachs, spinobifida, cerebral palsy, parkinson's disease, 
lou gehrigg disease, alzheimer's, hemophelia, Addsion's disease, Cushing's disease; or 

25 cancer. Animals can include but are not limited to mice, rats, pig, dog, monkey, 
chimpanzee, and human. 

158. Mice are injected with a sub-immunizing dose of pHSV-gag. O.Smg/ 
animal of pHSV-gag are administered via gene gun. This same immunizing dose may 
be administered by an alternative route such as intramuscular, intravenous, 

30 intraperitoneal, intracranial, or subcutaneous. Mice are then monitored and starting at 
15 days post immunization mice are bled every three days to day 30 post immunization. 
Blood samples are collected for serum extraction and for the isolation of peripheral 
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blood lymphocytes (PBL). Serum samples can be used immediately for ELISA assays 
to look for antibody to the antigen encoded in the vector or for cytokine increases 
following immunization. Samples can also be frozen and stored at -80C for testing at a 
later time. 

5 1 59. Lypmhocytes are removed from whole blood and diluted in media (e.g. 

RPMI 1640, DMEM, MEM, or EMEM) with 10% serum. PBL are then stained for 
antigen specific T cells using antigen specific MHC class I and MHC class II tetramers 
to be visualized via flow cytometry. Simultaneously or separately PBL are stained with 
surface markers to look at activation characteristics of the cells and other surface 

10 phenotypes. Some examples of the antibodies used are CD3, CD4, CDS, CDl la (LFA- 
1), CD43 (IBl 1), CD44, CD45RA, CD45RO, CD62L (L-selectin), CD69 and Bcl-2. 
Additionally, PBL can be stained for the production of intracellular cytokines such as 
IFN-Y, IL-4, TNF-a, IL-2, and IL-10 following stimulation with antigen. Lymphocytes 
can also be used in an ELISPOT assay for cytokine production following stimulation 

1 5 with specific antigen or CTL assay to assess killing activity. 

160. All of the stains and assays allow for the assessment of cell-mediated 
responses to the priming antigen. Tetramer staining allows for the enumeration of 
antigen specific T cells of a known specificity. This coupled with surface markers 
details the activation state of the antigen specific cells. Tetramer positive T cells 

20 (CD8+/CD3+, CD3+, or CD3+/CD4+) possess various surface markers that reveal the 
activation state of the cells. By multiplying percentages obtained from the analysis of 
the staining by the number of cells, the exact number of cells exhibiting any 
characteristic can be obtained. Exposure to an antigen creates cells that express high 
levels of CDl la, and CD44. Once these levels are increased on a cell, they will not 

25 decrease, and in conjunction with tetramer stains and other surface markers can lead to 
the identification of memory T cells. CD43 and CD69 both increase during activation, 
but decrease following clearance of the priming antigen. L-selecting will decrease on 
the surface of activated cells and increase over time to naive levels. BcU2 also provides 
a measure of the activation level of cells as activated cells will have decreased Bcl-2 

30 levels relative to naive T cells indicating a susceptibility to apoptosis whereas memory 
cells will have increased levels of Bcl-2. 
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161. ELISPOT and intracellular cytokine stains allow the enumeration of 
cytokine secreting antigen-specific cells following stimulation. This can be paired with 
the number of tetramer positive cells to determine what portion, if not all, of the T cells 
are functional. 

5 162. CTL assays determine the ability for antigen specific T cells to kill cells 

expressing the antigen. The assay reads on the cytolitic activity of T cells as 
determined by release of a chemiluminescent marker or radioactive material in labeled 
target cells. 

163. Like the cell-mediated responses, humoral responses may be measured 
10 in harvested PBL. PBL maybe stained for anti-mouse IgG, CD45R (B220), FN A, and 

CD138 (syndecan-1) to look at the activation of B cells or the number of plasma cells. 
Plasma cells, the effector arm of the humoral immune system are syndecan-lH-, and 
B220-. Memory B cells will be IgG+, B220+, and PNA+, the presence of PNA 
positive cells indicates transition through a germinal center reaction. Additionally, 
15 PBL may be used to enumerate the number of antigen specific plasma cells on a plasma 
cell elispot or the number of antigen specific B cells on a memory B cell assay. 

164. ELISA data can be combined with the humoral immunity data from the 
PBL to assess the effect of the immunization on the mouse. 

165. Mice that have measurable responses can be boosted and responses 

20 monitored. However, direct challenge is necessary as it indicates the ability to mount a 
protective immune response. Mice that do not have an immune response or mice that 
have an immune response that but need to be boosted can receive second and third 
immunizations by the same or different route than the priming immunization over the 
next few months. 

25 166. As mice cannot be infected with a lentivirus, monkeys (e.g., rhesus 

macaques) are used for challenge experiments. Monkeys can receive a similar 
construct (pHSV-SIV-gag) which produces similar levels of gag in vitro. Monkeys are 
inoculated with the pHSV-SIV-gag DNA, and after several weeks, they can be 
challenged with an infectious SIV. The animals can then be observed for protection 

30 from infection or disease. 

167. For a challenge, monkeys are given an infectioxis dose of an antigen (e.g. 
SrV) or stimulated in such a way as to induce a condition to be prevented. Challenged 
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monkeys can be assessed in the same way as immunized mice with additional assays 
used to look at antigen load following challenge and gross pathological and histological 
assessments made from tissue biopsy or necropsy. 

168. Monkeys can also be given an infectious dose of an antigen (e.g. SIV) or 
5 stimulated in such a way as to create a condition to be treated. Once the condition is 

established, monkeys are given a therapeutic dose of the immunizing antigen and 
monitored as to the effects on the condition. A successful therapy will not necessarily 
clear the condition or infection, but may slow or stop progression. The same assays to 
those used for prophylactic inununization are used to characterize the responses 
10 Methods 

Flow cytometry 

169. Cells are suspended at a concentration that is appropriate according to 
the antibody manufacturers instructions in FACS buffer (2% FCS (or BSA) in PBS 
(0.2% NaNa can also be added)). Antibodies (CD3, CD4, CD8, CDl la (LFA-1), CD43 

15 (IBl 1), CD44, CD45R (B220), CD45RA, CD45RO, CD62L (L-selectin), CD69, 
CD 138, PNA, anti-mouse IgG) are added to the cells according to the manufacturers 
instructions and incubated for 30min at 4C in the dark. Cells are washed 3X in FACS 
Buffer. 1 wash comprises centrifuging the cells at 800rpm for 3min to pellet the cells, 
removing the media, and resuspending the cells. After the third wash, cells are 

20 resuspended in 2% PFA in PBS. Staining is analyzed on a FACSCalibur flow 
cytometry instrument (Beckton-Dickenson) or other suitable cytometer. 
Intracellular cytokine staining* 

170. Cells are stimulated for 5hrs in the presence of antigen and BFA 
(meninsen may be substituted for BFA) at 37C (this stimulation is not needed for Bcl-2 

25 staining). After the 5hr incubation, cells are centrifuged and the media removed and 
resuspended in FACS buffer (2% FCS (or BSA) in PBS (0.2% NaNa can also be 
added)). Antibodies (CD3, CD4, CD8, CDl la (LFA-1), CD43 (IBl 1), CD44, CD45R 
(B220), CD45RA, CD45RO, CD62L (L-selectin), CD69, CD 138, PNA, anti-mouse 
IgG) are added to the cells according to the manufacturers instructions and incubated 

30 for 30min at 4C in the dark. Cells are washed 3X in FACS Buffer. 1 wash comprises 
centrifuging the cells at SOOrpm for 3min to pellet the cells, removing the media, and 
resuspending the cells. After the third wash, cells are resuspended in cytofix/cytoperm 
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solution for 20min at 4C. Cells are then washed 3X in a permwash solution and 
resuspended at a concentration that is appropriate according to the antibody 
manufacturers instructions in permwash. Antibodies (IFN-y, TNF-a, IL-2, IL-4, IL-10, 
and Bcl-2) are added to the cells according to the manufacturers instructions and 
incubated for 30min at 4C in the dark. Cells are washed 3X in Perm Wash and 2X in 
FACS Buffer. After the second FACS Buffer wash, cells are resuspended in 2% PFA 
in PBS. Staining is analyzed on a FACSCalibur flow cytometry instrument (Beckton- 
Dickenson) or other suitable cytometer. Intracellular kits are commercial available 
through (Beckton-Dickenson). 
ELISA 

171. 96-well ELISA plates are coated with antigen overnight at 4C. Plates 
are blocked with a suitable blocking media containing 10% PCS for l-2hrs at RT. 
Plates are resuspended in a known volume of an ELISA diluent (e.g., PBS+10% FCS) 
and serum is added to the plate and incubated at RT for l.Shrs. Plates are washed with 
PBS-K).l%Tween 3X and lOOtil of antibodies (e.g., HRPO conjugated anti-mouse IgG) 
diluted 1 : 1000 in ELISA diluent is added to each well. Plates are incubated for 1 .5hrs 
at RT and then washed 3x with PBS+0. l%Tween. Plates are then coated with 100|j.l of 
a chromagen substrate (e.g., o-phenyldiamine + 3% H2O2 in citrate buffer (Sodium 
citrate in H2O pH=5) and incubated for Ihr in the dark. The reaction is stopped by 
adding lOO^il of IN HCl. Plates are read in a ELISA plate reader. 

Plasma cell ELISPOT 

172. 96-well filter (ELISPOT) plates are coated with antigen overnight. 
Plates are washed IX in PBS-0.1%Tween and 3X in PBS. Plate are blocked l-2hrs 
with media + 10% FCS. After blocking, media is removed and lOOjil of media are 
added to each well. Effector cells (cells containing lymphocytes fi'om an immunized 
animal (e.g., PBL, splenocytes, hepatocytes, and bone marrow) are added to the plate 
and serial dilutions are made. Plates are incubated for Shrs at 37C. After the Shr 
incubation, plates are washed 3X in PBS and 3X in PBS+0. 1% Tween. Plates are then 
coated with 100|il PBS+0.1% Tween+1% FCS and biotinylated Ab at 1/100 
concentration overnight at 4C. After incubation plates are washed 4X in 

PBS+0. l%Tween and incubated for Ihr at RT in PBS+0. l%Tween+l%FCS containing 
HRP-avidinD at 1/1000. Plates are washed 3X with PBS and a suitable chromagen 
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substrate is added (e.g., AEC). Plates are allowed to develop for 8-15min and the 
reaction is stopped with washing the plates with water. Spots are visualized and 
counted using a dissection microscope or ELISPOT plate reader. 
Cytokine ELISPOT 

5 173. The day prior to the assay coat each well with 50 ^l 10 |ig/ml purified 

anti-citokine antibody (e.g., IFN-y, TNF-a, IL-2, IL-4, IL-10). Incubate overnight at 4 
**C. Wash plates 4 times with 200 |il PBS. Block wells with 200 jil 10%FCS complete 
medium, incubate at room temperature for at least 1 hour. Add effector cells to the top 
wells and make serial dilutions. Discard 100 i^l of cells from the last dilution. Add the 

0 stimulating antigen and IL-2 to the wells. Incubate plates at 37 C, 5% CO2 for 16-40 
hours (use level to ensure that plates stay even horizontally). Discard cells and wash 5 
times with PBS-Tween). Add 100 |il 2 ^xg/ml biotinylated-anti-cytokine antibody (e.g., 
IFN-Y, TNF-a, IL-2, IL-4, IL-10) diluted in PBS-Tween. Incubate at room temperature 
for 2 hours or overnight at 4C. Wash plate 5 times with PBS-Tween, 200^1 each wash. 

5 Add 100 jil 1:1000 Strepavidin -HRP, incubate 1 hour at room temperature, (avoid 
high background by not exceeding 1 hour incubation). Wash wells 5 times with PBS- 
Tween. Add 100 |al chromagen substrate per well, incubate at room temperature for 
10-20 minutes in dark (at 10 minutes, observe the color developed, if the color is not 
fully intense, incubate few more minutes and double check). Wash plates under 

0 running tap water and air-dry plates (in hood with light tumed-off). Count and record 
numbers of spots/well by ELISPOT plate reader or stereomicroscope (20x). 
CTL 

174. Direct ex-vivo cytolytic killing is measured in a 5hr ^'Cr release assay. 
Briefly, target cells are labeled with 51Cr for Ihr at 37C. Following the Ihr incubation, 

5 cells are centrifuged for 8min at 1200rpm to pellet the cells. The media is poured off 
and the pellet resuspended in fresh media. Cells are centrifuged and resuspended two 
additional times. After the third spin, cells are resuspended in an appropriate amount of 
media to attain the desired effector to target ration (e.g., a 50: 1 effector: target ratio 
with effector cells at a concentration of 1x10^/ ml would require target cells to be at a 

0 concentration of 2xlOVml), and contacted with antigen. Effector cells (cells containing 
lymphocytes from an immunized animal (e.g., PBL, splenocytes, hepatocytes, and bone 
manrow) are contacted with the target cells for 5-6hrs at 37C. Cells are pelletted by 
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centrifugation and supematants harvested. Cr release is measured on an appropriate 
instrument. Percent killing is (experimental release-spontaneous release / maximum 
release — spontaneous release) x 100. 

175. Secondary killing to measure memory cell responses to antigen can also 
5 be measured. In this method, effector cells are incubated for 6 days in the presence of 

antigen stimulation and then used in a cytolytic assay as described above. 
Example 3 

1 76. In accordance with the methods described above the pHS V vector comprising an 
SIV-Gag-pl7/p27 construct was made. This vector is useful to establish a model for the study of 

10 the effectiveness of a particular treatment or prophylactic vaccine. For example, such a model can 
comprise a system to establish a foamy live viral vector for vaccine use. The pHSV with an SIV 
gag (pl7,p27) insert has been tested in vitro and shown to express similar levels of SIV gag as the 
HIV gag engineered vector. This SIVgag live viral vector can be used to inoculate rhesus macaques 
in order to determine protective immune responses that can develop following in vivo expression of 

1 5 the vector gene products including the SIV gag protein. Inoculated animals can then be challenged 
with wild type SIV to further determine any potential vaccine induced efficacy by studying primary 
(sterilizing immunity) and secondary (time to morbidity) end points. 
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CLAIMS 

What is claimed: 

1. A live replicating human spumavirus vector (pHSV) suitable for human use 
comprising an immunizing construct, wherein the immunizing construct partially 
replaces the bet gene. 

2. The vector of claim 1, wherein the vector has the sequence of SEQ ID NO:l. 

3. The vector of claim 1, further comprising a reporter gene. 

4. The vector of claim 3, wherein the reporting gene is GFP. 

5. The vector of claim 1, wherein the immunizing construct is an antigen encoding 
nucleic acid. 

6. The vector of claim 5, wherein the antigen-encoding nucleic acid encodes a viral 
antigen. 

7. The vector of claim 6, wherein the viral antigen is an antigen from a virus selected 
from the group consisting of Herpes simplex virus type-1, Herpes simplex virus type-2, 
Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus. Human herpesvirus 6, 
Human herpesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus, 
Hepatitis A virus, Hepatitis B virus. Hepatitis C virus. Hepatitis D virus, Hepatitis E 
virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Human Papilomavirus, Respiratory syncytial virus. Adenovirus, 
Coxsackie virus, Dengue virus, Mumps virus, Poliovinis, Rabies virus, Rous sarcoma 
virus. Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus, Eastern 
Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 
Murray Valley fever virus. West Nile virus, Rift Valley fever virus, Rotavirus A, 
Rotavirus B, Rotavirus C, Sindbis virus. Simian Immunodeficiency cirus. Human T- 
cell Leukemia virus type-1. Hantavirus, Rubella virus, Simian Immunodeficiency virus, 
Human Immunodeficiency virus type-1, and Human Immunodeficiency virus type-2. 

8. The vector of claim 7, wherein the viral antigen is HIV-GAG pl7/p24. 

9. The vector of claim 7, wherein the immunizing construct is SIV-GAG. 

10. The vector of claim 5, wherein the antigen-encoding nucleic acid encodes a 
bacterial antigen. 

1 1 . The vector of claim 10, wherein the bacterial antigen is an antigen from a 
bacterium selected from the group consisting of M tuberculosis, M. bovis, M. bovis 
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Strain BCG, BCG substrains, M avium, M. intracellulare^ M qfricanum, M. kansasii, 
Af. marinum, M ulcerans, M. avium subspecies paratuberculosis, Nocardia asteroideSy 
other Nocardia species, Legionella pneumophila, other Legionella species, Salmonella 
typhi, other Salmonella species. Shigella species. Yersinia pestis, Pasteurella 
haemolytica, Pasteurella multocida, other Pasteurella species, Actinobacillus 
pleuropneumonias Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other 
Brucella species, Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia 
trachomatis. Chlamydia psittaci, Coxiella burnetii, oihQv Rickettsial species. Ehrlichia 
species, Staphylococcus aureus. Staphylococcus epidermidis. Streptococcus pyogenes. 
Streptococcus agalactiae, Bacillus anthracis, Escherichia coli, Vibrio cholerae, 
Campylobacter species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas 
aeruginosa, other Pseudomonas species, Haemophilus influenzae, Haemophilus 
ducreyi, other Hemophilus species, Clostridium tetani, other Clostridium 
species. Yersinia enterolitica, and other Yersinia species, 

12, The vector of claim 5, wherein the peptide encoding nucleic acid encodes a 
parasitic antigen. 

13, The vector of claim 12, wherein the parasitic antigen is an antigen from a parasite 
selected from the group consisting of Toxoplasma gondii, Plasmodium falciparum, 
Plasmodium vivax, Plasmodium malariae, other Plasmodium species,, Trypanosoma 
brucei, Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica, 

14. The vector of claim 5, wherein the peptide encoding nucleic acid encodes a cancer 
antigen. 

15. The vector of claim 14, wherein the cancer antigen is an antigen from a cancer 
selected from the group consisting of lymphomas (Hodgkins and non-Hodgkins), B cell 
lymphoma, T cell lymphoma, leukemias, myeloid leukemia, carcinomas, carcinomas of 
solid tissues, squamous cell carcinomas of mouth, throat, larynx, lung, head and neck, 
adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastomas, neuroblastomas, 
plasmacytomas, histiocytomas, melanomas, adenomas, hypoxic tumours, myelomas, 
AIDS-related lymphomas or sarcomas, metastatic cancers, mycosis frangoides, bladder 
cancer, brain cancer, nervous system cancer, head and neck cancer, kidney cancer, lung 
cancers such as small cell lung cancer and non-small cell lung cancer, 
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neuroblastoma/glioblastoma, ovarian cancer, liver cancer, colon cancer, cervical 
cancer, cervical carcinoma, breast cancer, and epithelial cancer, renal cancer, 
genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck 
carcinoma, large bovs^el cancer, hematopoietic cancers; testicular cancer; colon and 
rectal cancers, prostatic cancer, or pancreatic cancer. 

16. The cancer antigen of claim 15, wherein the antigen is selected from the list of 
cancer antigen consisting of mucin (Muc-1; a 20 amino acid core repeat for Muc-1 
glycoprotein, present on breast cancer cells and pancreatic cancer cells), the Ha-ras 
oncogene product, p53, carcino-cmbryonic antigen (CEA), the raf oncogene product, 
gpl00/pmell7, GD2, GD3, GM2, TF, sTn, MAGE-1, MAGE-3, BAGE, GAGE, 
tyrosinase, gp75, Melan- A/Mart- 1 , gplOO, HER2/neu, EBV-LMP 1 & 2, HPV-F4, 6, 7, 
prostate-specific antigen (PSA), HPV-16, MUM, alpha-fetoprotein (AFP), CO 17- 1 A, 
GA733, gp72, p53, the ras oncogene product, HPV E7, Wilm's tumor antigen-1, 
telomerase, and melanoma gangliosides. 

17. A cell comprising the vector of claim 1. 

18. A method of detecting the expression of the vector of claim 1, comprising 
obtaining a sample containing the vector, contacting the sample with an antibody 
directed to antigen encoded by the irnmunizing construct, and detecting antigen bound 
to the antibody, the presence of bound antigen indicating expression of the vector. 

19. The method of claim 18, wherein the sample is blood from a subject. 

20. The method of claim 18, wherein the sample is tissue biopsy from a subject. 

21 . A method of treating. a . subject with a condition comprising administering to the 
subject the vector of claim 1 . 

22. The method of claim 21, wherein the condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of Herpes simplex vi;Tis t>pe-l. Herpes simplex virus type-2. 
Cytomegalovirus, Ep^teinrBan- virus. Varicella-zoster virus. Human herpesvirus 6, 
Human heipesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus. 
Hepatitis A virus, Hepatitis B virus. Hepatitis C virus. Hepatitis D yii'us. Hepatitis E 
virus, Rhinovirus, Coronavims, Influenza virus A, Influenza vims B, Measles virus, 
Polyomavinis, Human Papilomavirus, Respiratory syncytial vims. Adenovirus, 
Coxsackie vims. Dengue vims. Mumps vims, Poliovirus, Rabies vims, Rous sarcoma 
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virus, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus, Eastern 
Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 
Murray Valley fever virus, West Nile virus. Rift Valley fever virus. Rotavirus A, 
Rotavirus B, Rotavirus C, Sindbis virus. Simian Immunodeficiency cirus, Human 
cell Leukemia virus type-1, Hantavirus, Rubella virus, Simian Immunodeficiency virus, 
Hunian Immunodeficiency virus type- 1 , and Human Immunodeficiency virus type-2. 

23. The method of claim 2 1 , wherein the condition is associated with bacterial 
infection, and the immunizing construct encodes an antigen from a bacterium selected 
from the group sonsisting of M tuberculosis, M, bovis, M, bovis strain BCG, BCG 
substrains, M avium, M, intracellular^ M. africanum, M kansasii, M marinum, M 
ulcerans, M avium subspecies paratuberculosis, Nocardia asteroides, other Nocardia 
species, Legionella pneumophila, other Legionella species. Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 
Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumoniae. 
Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other Brucella species, 
Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittacU Coxiella bumetti, oXh^v Rickettsial species, Ehrlichia species. Staphylococcus 
aureus. Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus 
agalactiae, Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia 
enterolitica, and other Yersinia species, 

24. The method of claim 21, wherein the condition is associated with parasitic 
infection, and the immunizing construct encodes an antigen from a parasite selected 
from the group consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 
vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 
Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica. 

25. The method of claim 21, wherein the condition is cancer, and the immunizing 
construct encodes a tumor antigen from the list consisting of human epithelial cell 
mucin (Muc-1; a 20 amino acid core repeat for Muc-1 glycoprotein, present on breast 
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cancer cells and pancreatic cancer cells), the Ha-ras oncogene product, p53, carcino- 
embryonic antigen (CEA), the raf oncogene product, gpl00/pmell7, GD2, GD3, GM2, 
TF, sTn, MAGE-1, MAGE-3, BAGE, GAGE, tyrosinase, gp75, Melan-A/Mart-1, 
gplOO, HER2/neu, EBV-LMP 1 & 2, HPV-F4, 6, 7, prostate-specific antigen (PSA), 
HPV-16, MUM, alpha-fetoprotein (AFP), C017-1 A, GA733, gp72, p53, the ras 
oncogene product, HPV E7, Wilm's tumor antigen- 1, telomerase, and melanoma 
gangliosides. 

26. A method of preventing a condition in a subject comprising administering to the 
subject the vector of claim 1 . 

27. The method of claim 26, wherein the condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of Herpes simplex virus type-1, Herpes simplex virus type-2. 
Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus, Human herpesvirus 6, 
Human herpesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus. 
Hepatitis A virus. Hepatitis B virus. Hepatitis C virus. Hepatitis D virus, Hepatitis E 
virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Human Papilomavirus, Respiratory syncytial virus. Adenovirus, 
Coxsackie virus. Dengue virus, Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 
virus, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus. Eastern 
Equine Encephalitis virus, Japanese Encephalitis vims, St. Louis Encephalitis virus, 
Murray Valley fever virus. West Nile virus, Rift Valley fever virus. Rotavirus A, 
Rotavirus B, Rotavirus C, Sindbis virus, Simian Immunodeficiency cirus, Human T- 
cell Leukemia virus type-1. Hantavirus, Rubella virus, Simian Immunodeficiency virus. 
Human Immunodeficiency vims type-1, and Human Immunodeficiency virus type-2. 

28. The method of claim 26, wherein the condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of M tuberculosis, M bovis, M, bovis strain BCG, BCG substrains, A/. 
avium, M intracellulare, M, africanum, M kansasii. M marinum, M. ulcerans, M 
avium suhspccits paraiuberculosis, Nocardia asteroides, other Nocardia species, 
Legionella pneumophila, oXh^x Legionella species. Salmonella typhi, other Salmonella 
species. Shigella species, Yersinia pestis, Pasteurella haemolytica, Pasteurella 
multocida, other Pasteurella species, Actinobacillus pleuropneumoniae. Listeria 
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monocytogenes. Listeria ivanovii. Brucella abortus, other Brucella species, Cowdria 
ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia psittaci, 
Coxiella bumetti, other Rickettsial species, Ehrlichia species, Staphylococcus aureus. 
Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus agalactiae. 
Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter species, Neiserria 
meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other Pseudomonas 
species, Haemophilus influenzae, Haemophilus ducreyi, other Hemophilus species, 
Clostridium tetani, other Clostridium species, yersinia enterolitica, and other Yersinia 
species, 

29. The method of claim 28, wherein the condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium vivax, 
Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, Trypanosoma 
cruzi, Leishmania major, other Leishmania species.. Schistosoma mansoni, other 
Schistosoma species., and Entamoeba histolytica. 

30. A method of making a model for use in screening for substances effective in 
treating or preventing a disease associated with an immunizing construct, comprising 
administering the vector of claim 1 to an animal capable of manifesting a characteristic 
of the immunizing construct. 

31. A model made in accordance with the method of claim 30. 

32. A method of screening for a substance effective in treating a disease associated 
with an immunizing construct comprising: a) administering the substance to the model 
of claim 31; and b) assaying for an change in the course of the disease as compared to 
an the course of the disease in a control subject; an improvement in the course of the 
disease in the presence of the substance identifies a substance that is effective in 
treating the disease. 

33. A method of screening for an immunizing construct effective in preventing a 
disease associated with an immunizing construct comprising: a) administering the 
vector of claim 1 to a subject; b) subjecting the subject to treatment that will induce the 
disease or characteristic of the disease; and c) assaying for an change in the course of 
the disease as compared to an the course of the disease in a control subject; an 
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improvement in the course of the disease in the presence of the substance identifies a 
substance that is effective in preventing the disease. 

34. A method of screening for an immunizing construct effective in treating a disease 
effective in treating a disease associated with an immunizing construct comprising: a) 
subjecting the subject to treatment that will induce the disease or characteristic of the 
disease; b) administering the vector of claim 1 to a subject; and c) assaying for an 
change in the course of the disease as compared to an the course of the disease in a 
control subject; an improvement in the course of the disease in the presence of the 
substance identifies a substance that is effective in treating the disease. 

35. The method of claims 21-34, wherein the subject is a cell. 

36. The method of claims 21-34, wherein the subject is a mouse. 

37. The method of claims 21-34, wherein the subject is a monkey. 

38. The method of claims 21-34, wherein the subject is a human 



59 



wo 2004/0031S3 ^PCT/US2003/020325 

1/8 




wo 2004/003153 



2/8 



CT/US2003/020325 



JHa 1(14391) 
LTR 
4;9n<14032> 




M« I (13577) 
//0M(13»I)| 

Mo I (12131) 
lfe»RV(12S26)[ 

AmUi 
H/i 11(12783) 
Ssl 0(12778)11 
Ateo 1(12548) 
Cla I(t2S19)l 
^«rn(l2433)\\ 

^1(12291) " 
Ai 1(11160} 
taH 1(12013) 
&«R1(J188S) 
Aat n(lU79i 
Mh»I(1163«)\\ 
fVKD(lll02)Vi 
Jnwl(10855)^^ 
Ah1(I 

JBtoRV (10039^^ 
S£f|<9899) 
5dl(966t) 
«elI(95S6) 
i!r/N» 1(9477) 
Wvrn(«394)^ 
^091(9348). 
I (9S29) 
fiooRV (9068) 
^11(8806) 
^cel(879'7) 



1(2213) 

// M^mm 

LTR 

,nkiic>i7i) 

/flvrO(3SO) 
S;mRV(388«) 
.A^ID(41«1) 
JCbo 1(4440) 
£18ID(49«0) 

vMb9l(4775) 
A V^iJc* 1(4826) 
\\5*i»I(J032) 

I \\\ ^'<""> 
r^^iTn(J355) 

\l\ ^PM.0(S41S) 

m INv 0(3338) 

UjifA»I(5801) 

n //pal (5822) 



\Aoc\ifisa4} 

W/Ml(6095) 
/V» 0(6208) 
A>al(«»29) 
IMi 0(6412) 
'mx»I(7073) 



Fig. 2 



wo 2004/003153 




•CT/US2003/020325 



3/8 




wo 2004/003153 ^ ^CT/OS2003/020325 

4/8 




H 
PQ 



Ph 

O ' CO ^ 

I ^ =tt: ^ =tt ^ 

^ a> D O) W 0) 

CO a a c c c 

ffi ^ ^ ^ ^ ^ 



^ (N m rf- u-^ vo 



BEST AVAILABLE COPY 



wo 2004/003153 ^lcT/US2003/020325 

5/8 




I 1 1 1 1 1 1 1 1 1 

00>ODt^lDVO"*fOCNi-0 




b£ST AVAILABLE COPY 



wo 2004/003153 ^ WCT/US2003/020325 

7/8 




BEST AVAILABLE COPY 



wo 2004/003153 ^PcT/US2003/020325 

8/8 



3 0UOI0 I 
ASHd 




I i i 



i 



i t 



I I 



Z 9UO\0 I 

ASHd 



I I i 



I I I 



ri 



oo 



)|90UI 



ASHd Tjl;^ 




ft 



BEST AVAILABLE COPY 



wo 2004/003153 



^^CT/US2003/020325 

SEQ. ID NO. I 



1 CAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT 
GTCCACCGTG AAAAGCCCCT TTACACGCGC CTTGGGGATA AACAAATAAA 



51 


TTCTAAATAC 
AAGATTTATG 


ATTCAAATAT 
TAAGTTTATA 


GTATCCGCTC 
CATAGGCGAG 


ATGAGACAAT 
TACTCTGTTA 


AACCCTGATA 
TTGGGACTAT 


101 


AATGCTTCAA 
TTACGAAGTT 


TAATATTGAA 
ATTATAACTT 


AAAGGAAGAG 
TTTCCTTCTC 


TATGAGTATT 
ATACTCATAA 


CAACATTTCC 
GTTGTAAAGG 


151 


GTGTCGCCCT 
CACAGCGGGA 


TATTCCCTTT 
ATAAGGGAAA 


TTTGCGGCAT 
AAACGCCGTA 


TTTGCCTTCC 
AAACGGAAGG 


TGTTTTTGCT 
ACAAAAACGA 


201 


CACCCAGAAA 
GTGGGTCTTT 


CGCTGGTGAA 
GCGACCACTT 


AGTAAAAGAT 
TCATTTTCTA 


GCTGAAGATC 
CGACTTCTAG 


AGTTGGGTGC 
TCAACCCACG 


251 


ACGAGTGGGT 
TGCTCACCCA 


TACATCGAAC 
ATGTAGCTTG 


TGGATCTCAA 
ACCTAGAGTT 


CAGCGGTAAG 
GTCGCCATTC 


ATCCTTGAGA 
TAGGAACTCT 


301 


GTTTTCGCCC 
CAAAAGCGGG 


CGAAGAACGT 
GCTTCTTGCA 


TTTCCAATGA 
AAAGGTTACT 


TGAGCACTTT 
ACTCGTGAAA 


TAAAGTTCTG 
ATTTCAAGAC 


351 


CTATGTGGCG 
GATACACCGC 


CGGTATTATC 
GCCATAATAG 


CCGTGTTGAC 
GGCACAACTG 


GCCGGGCAAG 
CGGCCCGTTC 


AGCAACTCGG 
TCGTTGAGCC 


401 


TCGCCGCATA 
AGCGGCGTAT 


CACTATTCTC 
GTGATAAGAG 


AGAATGACTT 
TCTTACTGAA 


GGTTGAGTAC 
CCAACTCATG 


TCACCAGTCA 
AGTGGTCAGT 


451 


CAGAAAAGCA 
GTCTTTTCGT 


TCTTACGGAT 
AGAATGCCTA 


GGCATGACAG 
CCGTACTGTC 


TAAGAGAATT 
ATTCTCTTAA 


ATGCAGTGCT 
TACGTCACGA 


501 


GCCATAACCA 
CGGTATTGGT 


TGAGTGATAA 
ACTCACTATT 


CACTGCGGCC 
GTGACGCCGG 


AACXTACTTC 
TTGAATGAAG 


TGACAACGAT 
ACTGTTGCTA 


551 


CGGAGGACCG 
GCCTCCTGGC 


AAGGAGCTAA 
TTCCTCGATT 


CCGCTTTTTT 
GGCGAAAAAA 


GCACAACATG 
CGTGTTGTAC 


GGGGATCATG 
CCCCTAGTAC 


601 


TAACTCGCCT 
ATTGAGCGGA 


TGATCGTTGG 
ACTAGCAACC 


GAACCGGAGC 
CTTGGCCTCG 


TGAATGAAGC 
ACTTACTTCG 


CATACCAAAC 
GTATGGTTTG 


651 


GACGAGCGTG 
CTGCTCGCAC 


ACACCACGAT 
TGTGGTGCTA 


GCCTGCAGCA 
CGGACGTCGT 


ATGGCAACAA 
TACCGTTGTT 


CGTTGCGCAA 
GCAACGCGTT 


701 


ACTATTJ'wACT 
TGATAATTGA 


GGCGAACTAC 
CCGCTTGATG 


TTACTCTAGC 
AATGAGATCG 


TTCCCGGCAA 
AAGGGCCGTT 


CAATTAATAG 
GTTAATTATC 


751 


ACTGGATGGA 
TGACCTACCT 


GGCGGATAAA 
CCGCCTATTT 


GTTGCAGGAC 
CAACGTCCTG 


CACTTCTGCG 
GTGAAGACGC 


CTCGGCCCTT 
GAGCCGGGAA 


801 


CCGGCTGGCT 
GGCCGACCGA 


GGTTTATTGC 
CCAAATAACG 


TGATAAATCT 
ACTATTTAGA 


GGAGCCGGTG 
CCTCGGCCAC 


AGCGTGGGTC 
TCGCACCCAG 


851 


TCGCGGTATC 
AGCGCCATAG 


ATTGCAGCAC 
TAACGTCGTG 


TGGGGCCAGA 
ACCCCGGTCT 


TCCATTCGGG 
AGGTAAGCCC 


AGGGCATAGC 
TCCCGTATCG 


901 


ATCAATAGAT 
TAGTTATCTA 


GTGCTGCCCC 
CACGACGGGG 


TCAGTCCGTT 
AGTCAGGCAA 


GATACCTA.CT 
CTATGGATGA 


TGGTAAGCCC 
ACCATTCGGG 
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351 


TCCCGTATCG 
AGGGCATAGC 


TAGTTATCTA 
ATCAATAGAT 


CACGACGGGG 
GTGCTGCCCC 


AGTCAGGCAA 
TCAGTCCGTT 


* CTATGGATGA 
GATACCTACT 


1001 


ACGAAATAGA 
TGCTTTATCT 


CAGATCGCTG 
GTCTAGCGAC 


AGATAGGTGC 
TCTATCCACG 


CTCACTGATT 
GAGTGACTAA 


AAGCATTGGT 
TTCGTAACCA 


1051 


AACTGTCAGA 
TTGACAGTCT 


CCAAGTTTAC 
GGTTCAAATG 


TCATATATAC 
AGTATATATG 


TTTAGATTGA 
AAATCTAACT 


TTTAAAACTT 
AAATTTTGAA 


1101 


CATTTTTAAT 
GTAAAAATTA 


TTAAAAGGAT 
AATTTTCCTA 


CTAGGTGAAG 
GATCCACTTC 


ATCCTTTTTG 
TAGGAAAAAC 


ATAATCTCAT 
TATTAGAGTA 


11 5x 


GACCAAAATC 
CTGGTTTTAG 


CCTTAACGTG 
GGAATTGCAC 


AGTTTTCGTT 
TCAAAAGCAA 


CCACTGAGCG 
GGTGACTCGC 


TCAGACCCCG 
AGTCTGGGGC 


1201 


TAGAAAAGAT 
ATCTTTTCTA 


CAAAGGATCT 
GTTTCCTAGA 


TCTTGAGATC 
AGAACTCTAG 


CTTTTTTTCT 
GAAAAAAAGA 


GCGCGTAATC 
CGCGCATTAG 


12bl 


TGCTGCTTGC 
ACGACGAACG 


AAACAAAAAA 
TTTGTTTTTT 


ACCACCGCTA 
TGGTGGCGAT 


CCAGCGGTGG 
GGTCGCCACC 


TTTGTTTGCC 
AAACAAACGG 


13 0.1 


GGATCAAGAG 
CCTAGTTCTC 


CTACCAACTC 
GATGGTTGAG 


TTTTTCCGAA 
AAAAAGGCTT 


GGTAACTGGC 
CCATTGACCG 


TTCAGCAGAG 
AAGTCGTCTC 


1351 


. CGCAGATACC 
GCGTCTATGG 


AAATACTGTC 
TTTATGACAG 


CTTCTAGTGT 
GAAGATCACA 


AGCCGTAGTT 
TCGGCATCAA 


AGGCCACCAC 
TCCGGTGGTG 


1401 


TTCAAGAACT 
AAGTTCTTGA 


CTGTAGCACC 
GACATCGTGG 


GCCTACATAC 
CGGATGTATG 


CTCGCTCTGC 
GAGCGAGACG 


TAATCCTGTT 
ATTAGGACAA 


1451 


ACCAGTGGCT 
TGGTCACCGA 


GCTGCCAGIG 
CGACGGTCAC 


GCGATAAGTC 
CGCTATTCAG 


GTGTCTTACC 
CACAGAATGG 


GGGTTGGACT 
CCCAACCTGA 


1501 


C-AAGACGATA 
GTTCTGCTAT 


GTTACCGGAT 
CAATGGCCTA 


AAGGCGCAGC 
TTCCGCGTCG 


GGTCGGGCTG 
CCAGCCCGAC 


AACGGGGGGT 
TTGCCCCCCA 


1551 


TCGTGCACAC 
AGCACGTGTG 


AGCCCAGCTT 
TCGGGTCGAA 


GGAGCGAACG 
CCTCGCTTGC 


ACCTACACCG 
TGGATGTGGC 


AACTGAGATA 
TTGACTCTAT 


1601 


CCTACAGCGT 
GGATGTCGCA 


6AGCTATGAG 
CTCGATACTC 


AAAGCGCCAC 
TTTCGCGGTG 


GCTTCCCGAA 
CGAAGGGCTT 


GGGAGAAAGG 
CCCTCTTTCC 


1651 


CGGACAGGTA 
GCCTGTCCAT 


TCCGGTAAGC 
AGGCCATTCG 


GGCAGGGTCG 
CCGTCCCAGC 


GAACAGGAGA 
CTTGTCCTCT 


GCGCACGAGG 
CGCGTGCTCC 


1701 


GAGCTTCCAG 
CTCGAAGGTC 


GGGGAAACGC 
CCCCTTTGCG 


CTGGTATCTT 
GACCATAGAA 


TATAGTCCTG 
ATATCAGGAC 


TCGGGTTTCG 
AGCCCAAAGC 


1751 


CCACCTCTGA 
GGTGGAGACT 


CTTGAGCGTC 
GAACTCGCAG 


GATTTTTGTG 
CTTJlAAAACAC 


ATGCTCGTCA 
TACGAGCAGT 


GGGGGGCGGA 
CCCCCCGCCT 


1801 


GCCTATGGAA 
CGGATACCTT 


AAACGCCAGC 
TTTGCGGTCG 


AACGCGGCCT 
TTGCGCCGGA 


TTTTACGGTT 
AAAATGCCAA 


CCTGGCCTTT 
GGACCGGAAA 


1851 


TGCTGGCCTT 
ACGACCGGAA 


TTGCTCACAT 
AACGAGTGTA 


GTTCTTTCCT 
CAAGAAAGGA 


GCGTTATCCC 
CGCAATAGGG 


CTGATTCTGT 
GACTAAGACA 
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28 51 CAATCAACAA AACAATGATG ATGTAATCAT AAGGAAGTAG TTTAAAATAG 
GTTAGTTGTT TTGTTACTAC TACATTAGTA TTCCTTCATC AAATTTTATC 



•3 O A * 
^ 7 O X 


rAATAAGT 
CAATTATTCA 


TTATTAGTTA 
AATAATCAAT 


TATAGAAAAT 
ATATCTTTTA 


AATATAGGAT 
TTATATCCTA 


AAAGTATAAG 
TTTCATATTC 


^ Ct e 1 


GATTAAGGTA 
CTAATTCCAT 


TGAGGTGTGT 
ACTCCACACA 


GGCTCAACAC 
CCGAGTTGTG 


GTAGGGTGAC 
CATCCCACTG 


AAGAAAATCT 
TTCTTTTAGA 


"i n rt 1 
J U U J. 


Al_ ilj i AATAG 
TGACATTATC 


GACACAACAC 
CTGTGTTGTG 


CTCTAAAGTT 
GAGATTTCAA 


GCCCGTGGGA 
CGGGCACCCT 


AGGTGAAGTG 
TCCACTTCAC 


J U 3 X 


AsjAI UiisAATv. 
TCTAGCTTAG 


TTTCCTTAAC 
AAAGGAATTG 


GCAGACAGCT 
CGTCTGTCGA 


TTTTATCCAC 
AAAATAGGTG 


TAGGGATAAT 
ATCCCTATTA 


J X U X 


i i XJLAACjGA 
CAAAATTCCT 


ATACTATAGT 
TATGATATCA 


AATAGATTGA 
TTATCTAACT 


TAGTTTTAAC 
ATCAAAATTG 


AATGATGGAA 
TTACTACCTT 


J X 3 X 


ATAGTATATA 
TAT CAT AT AT 


AGGATAGTTT 
TCCTATCAAA 


CTAGATTGTA 
GATCTAACAT 


CGGGAGCTCT 
GCCCTCGAGA 


TCACTACTCG 
AGTGATGAGC 


J ^ U X 


v» i (^(.GTuGAG 

6ACGCAGCTC 


AGTGTACGAG 
TCACATGCTC 


ACTCTCCAGG 
TGAGAGGTCC 


TTTGGTAAGA 
AAACCATTCT 


AATATTTTAT 
TTATAAAATA 


J ^ 3 X 


A i L\3 L X. AX A A 

TAACAATATT 


TGTTACTATG 
ACAATGATAC 


ATCCATTAAC 
TAGGTAATTG 


ACTCTGCTTA 
TGAGACGAAT 


TAGATTGTAA 
ATCTAACATT 


.3 U X 


uVffNj X V7A 1 TviC 

CCCACTAACG 


AATGCTTTCT 
TTACGAAAGA 


GCATAAAACT 
CGTATTTTGA 


TTGGTTTTCT 
AACCAAAAGA 


TGTTAATCAA 
ACAATTAGTT 


J ^ 9 X 


1 rtAAC CG ACT 
ATTTGGCTGA 


TGATTCGAGA 
ACTAAGCTCT 


ACCTACTCAT 
TGGATGAGTA 


ATATTATTGT 
TATAATAACA 


CTCTTTTATA 
GAGAAAATAT 


J 4 U X 


CT i. T ATT AAG 
GAAATAATTC 


TAAAAGGATT 
ATTTTCCTAA 


TGTATATTAG 
ACATATAATC 


CCTTGCTAAG 
GGAACGATTC 


GGAGACATCT 
CCTCTGTAGA 


J 4 3 X 


a ^ 'P ^ A A t* A A 

Ava X V3 A 1 A r AA 
TCACTATATT 


GTGTGAACTA 
CACACTTGAT 


CACTTATCTT 
GTGAATAGAA 


AAATGATGTA 
TTTACTACAT 


ACTCCTTAGG 
TGAGGAATCC 


^ 3 V X 


AT AA - C AA i A 
TATTAGTTAT 


TACAAAATTC 
AT3TTTTAAG 


CATGACAATT 
GTACTGTTAA 


GGCGCCCAAC 
CCGCGGGTTG 


GTGGGGCTCG 
CACCCCGAGC 


3 5 51 


AATATAAGTC 
TTATATTCAG 


GGGTTTATTT 
CCCAAATAAA 


GTAAATTATC 
CATTTAATAG 


CCTAGGGACC 
GGATCCCTGG 


TCCGAGCATA 
AGGCTCGTAT 


3601 


GCGGGAGGCA 
CGCCCTCCGT 


TAT A A A. AG CC 
ATATTTTCGG 


TTATCTGTTA 


VjGCTTCAGGA 
CCGAAGTCCT 


AGTAATGTTG 
TCATTACAAC 


3651 


AAGAATATGA 
TTCTTATACT 


ACTTGATGTT 
TGAACTACAA 


GAAGCTCTGG 
CTTCGAGACC 


TTGTAATTTT 
AACATTAAAA 


AAGAGATAGA 
TTCTCTATCT 


3701 


AATATACCAA 
TTATATGGTT 


GAAATCCTTT 
CTTTAGGAAA 


ACATGGApAA 
TGTACCTCTT 


GTTATAGGTC 
CAATATCCAG 


TTCGCCTTAC 
AAGCGGAATG 


3751 


TGAAGGATGG 
ACTTCCTACC 


TGGGGACAAA 
ACCCCTGTTT 


TTGAGAGATT 
AACTCTCTAA 


TCAGATGGTA 
AGTCTACCAT 


CGTCTAATAT 
GCAGATTATA 
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3801 


TACAAGATGA 
ATGTTCTACT 


TGATAATGAA 
ACTATTACTT 


CCTTTACAGA 
GGAAATGTCT 


GACCTAGATA 
CTGGATCTAT 


TGAGGTAATA 
ACTCCATTAT 


3851 


CAACGAGCTG 
GTTGCTCGAC 


TAAACCCTCA 
ATTTGGGAGT 


TACAATGTTT 
ATGTTACAAA 


ATGATATCAG 
TACTATAGTC 


GACCATTAGC" 
CTGGTAATCG 


3 901 


TGAACTTCAA 
ACTTGAAGTT 


TTAGCCTTTC 
AATCGGAAAG 


AGGATTTAGA 
TCCTAAATCT 


TTTACCTGAA 
AAATGGACTT 


GGTCCATTGA 
CCAGGTAACT 


3 951 


GGTTTGGTCC 
CCAAACCAGG 


ATTGGCAAAT 
TAACCGTTTA 


GGACATTATG 
CCTGTAATAC 


TTCAAGGAGA 
AAGTTCCTCT 


TCCTTATAGT 
AGGAATATCA 


4001 


AGTTCTTACA 
TCAAGAATGT 


GACCAGTAAC 
CTGGTCATTG 


AATGGCCGAA 
TTACCGGCTT 


ACAGCCCAAA 
TGTCGGGTTT 


TGACTAGAGA 
ACTGATCTCT 


4051 


TGAACTGGAA 
ACTTGACCTT 


GATGTTCTTA 
CTACAAGAAT 


ATACTCAAAG 
TATGAGTTTC 


TGAAATAGAA 
ACTTTATCTT 


ATTCAAATGA 
TAAGTTTACT 


4101 


TAAATTTATT 
ATTTAAATAA 


GGAGTTGTAT 
CCTCAACATA 


GAAGTTGAAA 
CTTCAACTTT 


CTAGAGCTCT 
GATCTCGAGA 


TAG AAGACAA 
ATCTTCTGTT 


4151 


TTAGCTGAGA 
AATCGACTCT 


GATCTAGTAC 
CTAGATCATG 


AGGGCAAGGA 
TCCCGTTCCT 


GGAATATCCC 
CCTTATAGGG 


CAGGAGCTCC 
GTCCTCGAGG' 


4201 


TCGTTCTCGA 
AGCAAGAGCT 


CCACCAGTAA 
GGTGGTCATT 


GCAGCTTCTC 
CGTCGAAGAG 


AGGGTTACCA 
TCCCAATGGT 


AGTTTGCCCT 
TCAAACGGGA 


4251 


CTATACCTGG 
GATATGGACC 


GATTCATCCC 
CTAAGTAGGG 


AGGGCACCTT 
TCCCGTGGAA 


CACCTCCAAG 
GTGGAGGTTC 


GGCAACTTCT 
CCGTTGAAGA 


4301 


ACVCCCGGAA 
TGAGGGCCTT 


r^TATTCCTTG 
TATAAGGAAC 


GAGTTTAGGA 
CTCAAATCCT 


GATGATAGCC 
CTACTATCGG 


CACCTTCATC 
GTGGAAGTAG 


4351 


TAGTTTTCCT 
ATCAAAAGGA 


GGACCCTCTC 
CCTGGGAGA5 


AACCTCGTGT 
TTGGAGCACA 


TTCTTTCCAT 
AAGAAAGGTA 


CCGGGAAATC 
GGCCCTTTAG 


4401 


CTTTTGTTGA 
GAAAACAACT 


AGAAGAAGGT 
TCTTCTTCCA 


CATAGACCTA 
GTATCTGGAT 


GATCCCAGTC 
CTAGGGTCAG 


TAGAGAAAGG 
ATCTCTTTCC 


4451 


AGAAGAGAAA 
TCTTCTCTTT 


TTCTTCCTGC 
AAGAAGGACG 


TCCTGTACCG 
AGGACATGGC 


TCAGCACCTC 
AGTCGTGGAG 


CTATGATTCA 
GATACTAAGT 


4501 


GTATATACCA 
CATATATGGT 


GTACCACCTC 
CATGGTGGAG 


CACCACCGAT 
GTGGTGGCTA 


TGGCACGGTT 
ACCGTGCCAA 


ATACCTATTC 
TATGGATAAG 


4551 


AGCATATCAG 
TCGTATAGTC 


ATCTGTAACT 
TAGACATTGA 


GGAGAGCCTC 
CCTCTCGGAG 


CTAGAAACCC 
GATCTTTGGG 


AAGAGAAATA 
TTCTCTTTAT 


4601 


CCAATTTGGC 
GGTTAAACCG 


TAGGACGAAA 
ATCCTGCTTT 


TGCTCCTGCT 
ACGAGGACGA 


ATAGATGGAG 
TATCTACCTC 


TGTTCCCTGT 
ACAAGGGACA 


4651 


TACAACACCG 
ATGTTGTGGC 


GATCTAAGAT 
CTAGATTCTA 


GCAGAATAAT 
CGTCTTATTA 


TAATGCTATA 
ATTACGATAT 


CTAGGAGGAA 
GATCCTCCTT 


4701 


ATATTGGGCT 
TATAACCCGA 


ATCATTAACC 
TAGTAATTGG 


CCTGGAGACT 
GGACCTCTGA 


GTTTAACATG 
CAAATTGTAC 


GGACTCAGCA 
CCTGAGTCGT 
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4751 


GTAGCCACCT 
CATCGGTGGA 


TATTTATTAG 
ATAAATAATC 


AACCCATGGA 
TTGGGTACCT 


ACTTTTCCAA 
TGAAAAGGTT 


TGCATCAGCT 
ACGTAGTCGA 


4801 


TGGAAATGTA 
ACCTTTACAT 


ATAAAAGGCA 
TATTTTCCGT 


TAGTTGATCA 
ATCAACTAGT 


AGAAGGAGTG 
TCTTCCTCAC 


GCAACAGCAT 
CGTTGTCGTA 


4851 


ATACTTTGGG 
TATGAAACCC 


AATGATGCTT 
TTACTACGAA 


TCTGGACAAA 
AGACCTGTTT 


ATTATCAATT 
TAATAGTTAA 


AGTTTCTGGA 
TCAAAGACCT 


4901 


ATAATTAGAG 
TATTTHATCTC 


GATATTTGCC 
CTATAAACGG 


TGGACAAGCT 
ACCTGTTCGA 


GTAGTAACTG 
CATCATTGAC 


CATTACAACA 
GTAATGTTGT 


4951 


GCGTTTAGAC 
CGCAAATCTG 


CAAGAAATAG 
GTTCTTTATC 


ATAATCAAAC 
TATTAGTTTG 


AAGAGCAGAG 
TTCTCGTCTC 


ACTTTTATTC 
TGAAAATAAG 


5001 


AACATCTAAA 
TTGTAGATTT 


TGCTGTATAT 
ACGACATATA 


GAAATTTTAG 
CTTTAAAATC 


GCCTTAATGC 
CGGAATTACG 


CAGAGGACAA 
GTCTCCTGTT 


5051 


AGTATACGTG 
TCATATGCAC 


CTTCAGTGAC 
GAAGTCACTG 


TCCTCAACCC 
AGGAGTTGGG 


C6ACCATCCA 
GCTGGTAGGT 


GAGGTAGAGG 
CTCCATCTCC 


5101 


TCGAGGTCAA 
AGCTCCAGTT 


AATACTTCTA 
TTATGAAGAT 


GACCCTCTCA 
CTGGGAGAGT 


AGGACCAGCT 
TCCTGGTCGA 


AATAGCGGGC 
TTAT.CGCCCf? 


5151 


GGGGACGACA 
CCCCTGCTGT 


GCGCCCTGCT 
CGCGGGACGA 


TCTGGTCAAA 
AGACCAGTTT 


GCAACAGAGG 
CGTTGTCTCC 


ATCTAGTACT 
TAGATCATGA 


5201 


CAGAATCAAA 
GTCTTAGTTT 


ATCAAGATAA 
TAGTTCTATT 


TTTAAATCAA 
AAATTTAGTT 


GGAGGATATA 
CCTCCTATAT 


ATCTTCGACC 
TAGAAGCTGG 


3251 


CCGTACTTAC 
GGCATGAATG 


CAACCTCAAA 
GTTGGAGTTT 


GGTACGGAGG 
CCATGCCTCC 


AGGAC3TGGA 
TCCTGCACCT 


CGAAGATGr-xA 
GCTTCTACCT 


5301 


ACGATAATAC 
TGCTATTATG 


TAACAATCAA 
ATTGTTAGTT 


GAGTCCAGAC 
CTCAGGTCTG 


CATCAGATCA 
GTAGTCTAGT 


AGGT7CTCAA 
TCCAAGAGT7 


53 51 


ACTCCTAGGC 
TGAGGATCCG 


CAAATCAAGC 
GTTTAGTTCG 


AGGCTCTGGG 
TCCGAGACCC 


GTGCGTGGCA 
CACGCACCGT 


ATCAGTCACA 
TAGTCAGTGT 


5401 


AACTCCCAGA 
TTGAGGGTCT 


CCAGCTGCTG 
GGTCGACGAC 


GTCGCGGAGG 
CAGCGCCTCC 


AAGAGGTAAC 
TTCTCCATTG 


CACAACCGAA 
GTGTTGGCTT 


54 51 


ACCAACGATC 
TGGTTGCTAG 


ATCCGGTGCT 
TAGGCCACGA 


GGTGACTCAC 
CCACTGAGTG 


GCGCTGTCAA 
CGCGACAGTT 


TACCGTGACA 
ATGGCACTGT 


5501 


CAGAGTGCCA 
GTCTCACGGT 


CGTCCTCCAC 
GCAGGAGGTG 


AGATGAATCC 
TCTACTTAGG 


TCTTCAGCTG 
AGAAGTCGAC 


TTACAGCCGC 
AATGTCGGCG 


5551 


TTCCGGCGGA 
AAGGCCGCCT 


GATCAAAGGG 
CTAGTTTCCC 


ACTAAATTGT 
TGATTTAACA 


TAGCCCACTG 
ATCGGGTGAC 


GGATTCAGGG 
CCTAAGTCCC 


5601 


GCAACAATAA 
CGTTGTTATT 


CTTGTATTCC 
GAACATAAGG 


TGAAAGTTTT 
ACTTTCAAAA 


TTAGAAGATG 
AATCTTCTAC 


AACAACCTAT 
TTGTTGGATA 


5651 


TAAAAAGACT 
ATTTTTCTGA 


TTAATAAAAA 
AATTATTTTT 


CAATTCATGG 
GTTAAGTACC 


AGAAAAACAA 
TCTTTTTGTT 


CAAAATGTTT 
GTTTTACAAA 
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5 7 01 ATTATGTAAC CTTTAAAGTT AAAGGAAGAA AAGTGGAAGC AGAAGTGATA 
TAATACATTG GAAATTTCAA TTTCCTTCTT TTCACCTTCG TCTTCACTAT 

5 7 Si GCTTCTCCTT ATGAGTATAT TTTGCTGTCG CCAACAGATG TTCCTTGGTT 

CGAAGAGGAA TACTCATATA AAACGACAGC GGTTGTCTAC AAGGAACCAA 

58 01 AACACAGCAA CCACTTCAGT TAACAATTTT AGTTCCTCTT CAAGAATATC 

TTGTGTCGTT GGTGAAGTCA ATTGTTAAAA TCAAGGAGAA GTTCTTATAG 

5 9 51 AAGAGAAAAT CTTAAGTAAG ACTGCTCTTC CAGAAGATCA AAAACAACAA 
TTCTCTTTTA GAATTCATTC TGACGAGAAG GTCTTCTAGT TTTTGTTGTT 

59 01 TTAAAAACCT TGTTTGTCAA GTATGACAAT CTATGGCAAC ATTGGGAAAA 

AATTTTTGOA ACAAACAGTT CATACTGTTA GATACCGTTG TAACCCTTTT 

5 951 TCAAGTCGGG CATAGAAAAA TTAGGCCACA TAATATAGCA ACTGGTGATT 

AGTTCAGCCC GTATCTTTTT AATCCGGTGT ATTATATCGT TGACCACTAA 

6001 ATCCTCCTCG CCCTCAAAAA CAATATCCTA TTAATCCTAA GGCAAAGCCT 
TAGGAGGAGC GGGAGTTTTT GTTATAGGAT AATTAGGATT CCGTTTCGGA 

6 0 51 AGTATACAAA TTGTAATAGA TGACTTATTG AAACAAGGGG TGTTAACGCC 

TCATATGTTT AACATTATCT ACTGAATAAC TTTCTTCCCC ACAATTGCGG 

6101 TCAAAATAGT ACAATGAATA CACCAGTGTA TCCTGTTCCT AAACCAGATG 
AGTTTTATCA TGTTACTTAT 6TGGTCACAT AGGACAAGGA TTTGGTCTAC 

6151 GAAGGTGGAG AATGGTATTA GATTATAGAG AAGTAAATAA AACTATTCCA 
CTTCCACCTC TTACCATAAT CTAATATCTC TTCATTTATT TTGATAAG6T 

62 01 TTAACAGCTG CCCAAAACCA ACACTCTGCT GGTATTTTAG CTACTATTGT ' 
AATTGTCGAC GGGTTTTGGT TGTGAGACGA CCATAAAATC GATGATAACA 

62 31 TAGACAAAAA TATAAAACTA CCTTAGATTT AGCTAATGGA ■ TTTTGGGC^C 
ATCTCTTTTT ATATTTTGAT GGAATCTAAA TCGATTACCT. AAAACCCGAG 

63 01 ATCCTATTAC ACCAGAATCT TATTGGTTAA CAGCATTTAC CTGGCAAGGT 

TAGGATAATG TGGTCTTAGA ATAACCAATT GTCGTAAATG GACCGTTCCA 

63 51 AAACAGTATT GTTGGACACG TCTTCCTCAA GGATTTTTAA ATAGTCC^G" 

TTTGTCATAA CAACCTGTGC AGAAGGAGTT CCTAAAAATT TATCAGGTCG 

64 01 ATTGTTTACA GCTGATGTAG TAGATTTACT AAAAGAAATC CCTAATGTAC 

TAACAAATGT CGACTACATC ATCTAAATGA TTTTCTTTAG GGATTACATG 

TTclrlrlcl Ir^lt^l^'"' ^^^^AAGCC ATGATGATCC TAAAGAGCAT 
TTCACATACA ACTACTATAT ATAAATTCGG TACTACTAGG ATTTCTCGTA 

6501 GTTCAACAAT TAGAAAAAGT GTTTCAAATT TTACTACAGG CAGGATATGT 
CAAGTTGT.TA ATCTTTTTCA CAAAGTTTAA AATGATGTCC GTCCTATACA 

63 51 AGTATCTTTG AAAAAATCAG AAATTGG.TCA AAAAACTGTA GAATTTTTAG 
TCATAGAAAC TTTTTTAGTC TTTAACCAGT TTTTTGACAT CTTAAAAATC 

6601 GATTTAATAT TACTAAAGAA GGTCGTGGCC TAACAGACAC TTTTAAAACA 
CTAAATTATA ATGATTTCTT CCAGCACCGG ATTGTCTGTG AAAATTTTGT 
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6651 


AAACTGTTAA 
TTTGACAATT 


ATATTACTCC 
TATAATGAGG 


TCCAAAAGAC 
AGGTTTTCTG 


TTAAAGCAAT 
AATTTCGTTA 


TACAAAGCAT 
ATGTTTCGTA 


6701 


ATTAGGATTG 
TAATCCTAAC 


TTAAATTTTG 
AATTTAAAAC 


CTAGAAATTT 
GATCTTTAAA 


TATACCTAAT 
ATATGGATTA 


TTTGCTGAAC 
AAACGACTTG 


6751 


TGGTACAACC 
ACCATGTTGG 


ATTATACAAT 
TAATATGTTA 


TTAATAGCCT 
AATTATCGGA 


CAGCAAAAGG 
GTCGTTTTCC 


CAAATATATT 
GTTTATATAA 


6801 


GAGTGGTCTG 
CTCACCAGAC 


AAGAAAATAC 
TTCTTTTATG 


TAAACAATTA 
ATTTGTTAAT 


AATATGGTAA 
TTATACCATT 


TAGAAGCATT 
ATCTTCGTAA 


6851 


AAACACTGCC 
TTTGTGACGG 


TCTAATTTAG 
AGATTAAATC 


AAGAAAGGTT 
TTCTTTCCAA 


ACCAGAACAG 
TGGTCTTGTC 


AGACTGGTAA 
TCTGACCATT 


6901 


TTAAAGTCAA 
AATTTCAGTT 


TACTTCTCCA 
ATGAAGAGGT 


TCAGCAGGAT 
AGTCGTCCTA 


ATGTAAGATA 
TACATTCTAT 


TTATAATGAG 
AATATTACTC 


6951 


ACTGGTAAAA 
TGACCATTTT 


AGCCTATTAT 
TCGGATAATA- 


GTACCTAAAT 
CATGGATTTA 


TATGTGTTTT 
ATACACAAAA 


CCAAAGCAGA 
GGTTTCGTCT 


7001 


ATTAAAATTT 
TAATTTTAAA 


TCTATGTTAG 
AGATACAATC 


AAAAACTATT 
TTTTTGATAA 


AACTACAATG 
TTGATGTTAC 


CACAAAGCCT 
GTGTTTCGGA 


7051 


TAATTAAGGC 
ATTAATTCCG 


TATGG.AT^TG^CCATGGGAC 
ATAC'CTAAAC CGGTACCCTG 


AAGAAATATT 
TTCTTTATAA 


AGTTTATAGT 
TCAAATATCA 


7101 


CCCATTGTAT 
GGGTAACATA 


CTATGACTAA 
GATACTGATT 


AATACAAAAA 
TTAT GTTTTT 


ACTCCACTAC 
TGAGGTGATG 


CAGAAAGAAA 
GTCTTTCTTT 


7151 


AGCTTTACCC 
TCGAAATGGG 


AT'CAGATGGA 
TAATCTACCT 


TAACATGGAT 
ATTGTACCTA 


GACTTATTTA 
CTGAATAAAT 


GA.3\GATCCAA 
CTTCTAGGTT 


7201 


GAATCCAATT 
CTTAGGTTAA 


TCATTA.TGAT 
AGTA21TACTA 


AAAACCTTAC 
TTTTGGAATG 


CAGAACTTAA 
GTCTTGAATT 


GCATATTCCA 
CGTATAAGGT 


7251 


GATGTATATA 
CTACATATAT 


CATCTAGTCA 
GTAGATCAGT 


GTCTCCTGTT 
CAGAGGACAA 


AAACATCCTT 
TTTGTAGGAA 


CTCAATATGA 
GAGTTATACT 


7301 


AGGAGTGTTT 
TCCTCACAAA 


TATACTGATG 
ATATGACTAC 


GCTCGGCCAT 
CGAGCCGGTA 


CAAAAGTCCT 
GTTTTCAGGA 


GATCCTACAA 
CTAGGATGTT 


7351 


AAAGCAATAA 
TTTCGTTATT 


TGCTGGCATG 
ACGACCCTAC 


GGAATAGTAC 
CCTTATCATG 


ATGCCACATA 
TACGGTGTAT 


CAAACCTGAA 
GTTTGGACTT 


7401 


TATCAAGTTT 
ATAGTTCAAA 


TGAATCAATG 
ACTTAGTTAC 


GTCAATACCA 
CAGTTATGGT 


CTAGGTAATC 
GATCCATTAG 


ATACTGCTCA 
TATGACGAGT 


7451 


GATGGCTGAA 
CTACCGACTT 


ATAGCTGCAG 
TATCGACGTC 


TTGAATTTGC 
AACTTAAACG 


CTGTAAAAAA 
GACATTTTTT 


GCTTTAAAAA 
CGAAATTTTT 


7501 


TACCTGGTCC 
ATGGACCAGG 


TGTATTAGTT 
ACATAATCAA 


ATAACTGATA 
TATTGACtAT 


GTTTCTATGT 
CAAAGATACA 


AGCAGAAAGT 
TCGTCTTTCA 


7551 


GCTAATAAAG 
CGATTATTTC 


AATTACCATA 
TTAATGGTAT 


CTGGAAATCT 
GACCTTTAGA 


AATGGGTTTG 
TTACCCAAAC 


TTAATAATAA 
AATTATTATT 
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?tt??t?ggI all^^n^lt ^"C"*ATG GAAATCTATT GCTGAGTGTT 
CTTTTTCGGA GAATTTGTAT AGAGGTTTAC CTTTAGATAA CGACTCACAA 

7 6 51 TATCTATGAA ACCAGACATT ACTATTCAAC ATGAAAAAGG CATCAGCCTA 
TGGTCTGTAA TGATAAGTTG TACTTTTTCC GTaS^GgI? 

S™?GGTC Irllr^l^nl °*^<5GCAAT GCCCTAGCAG 'aT^GcVtg'c 

gtttatggtc ataagtatga ctttccgtta cgggatcgtc .TATTCGAACG 

AGTTATGTGG TTAATTGTAA TACCAAAAAA CCaITcctgG 
GTGGGTTCCT TCAATACACC AATTAACATT ATG6TTTTTT GGTTTGGACC 

f:!^"^*^'^^ GGATCAATTA TTACAGGGTC ATTATATAAA AGGATATCCC 
rfffj"^"^ CCTAGTTAAT AATGTCCCAG TAATATATTT TCCTATAGGG 

^^t^^tl^"^^ CATATTTTTT AGAAGATGGC AAAGTAAAAG TTTCCAGa'cc 
rjjfl!*!^!.:.'?'''^'^ TCTTCTACCG TTTCATTTTC AAAGGTCTgS 

IS^^o^S^"^^ aaaattattc cccctcagtc acacagacaa aaaa'ttgtgc 

ACTTCCCCAA TTTTAATAAG GGGGAGTCAG TCTGTCTGTT TTTTAACACG 

^^^^ ll^tt'Z^^^'' CAATTTGGCT CACACCGGAC GTGAAGCCAC TCTTTTA^ 
AAGTTCGGGT GTTAAACCGA GTGTGGCCTG CACTTCGGTG AGAAAATT?? 

8 00x ATTGCCAACC TTTATTGGTG GCCAAATATG AGAAAGGATG TGGTTAAACA 
TAACGGTTGG AAATAACCAC CGGTTTATAC TCTTTCCTAC ACCAATTTGT 

80S1 actaggacgc tgtcaacagt gtttaatcac aaatgcttcc aacaaag'cct 
rffj.':^.^!^'^- acagttgtca caaattagtg tttacqaagg ttgtJ??gS 

tctaagacca c-ataggcctc aaaaaccttt tgataaattc 

_. GACCAGGATA AGATTCTGGT CTATCCGGAG TTTTTGGAAA ACTATTTAaS 

81Si TTTATTGACT ATATTGGACC TTTGCCACCT TCACAGGGAT ACCTATATGT 
TATAACCTGG AAACGGTGGA AGTGTCCCTA TGgIJItIcI 

82 01 attagtagtt gttgatggaa tgacaggatt cacttggtta tacccca'ctI 
!l*7.?.!!^.f/^"*^"' actgtcctaa gtgaaccIat I?ggSg?gI? 

8 2 51 AGGCTCCTTC TACTAGCGCA ACTGTTAAAT CTCTCAATGT ACTCACTArx 
TGACAATTTA GAGAGTtI?! JgISgIt^I 

8 3 01 ATTGCAATTC CAAAGGTGAT TCACTCTGAT CAAGGTGCAG ''ATTCACTTr 
r^'^f.^.I^l.^.?// AGTGAGACTA GTTCCA^^J? gJIIgJgIIg 

llr^n^r'' G^TC^TGGG CAAAGGAAAG AGGTATACAT TTGGAaVtcA 
AAGTTGGAAA CGACTTACCC GTTTCCTTTC TCCATATGTA AACCTTAAGT 

84 01 GTACTCCTTA TCACCCCCAA A6TGGTAGTA AGGTGGAAAG Gliki^TAGT 
CATGAGGAAT AGTGGGGGTT TCACCATCAT TCCACCTTTC CTTTtJS^J 

lilil^rrG cralHl^^^ taaactgcta gtaggaagac'ccaca'aagtg 

CTGAAAATTG ATTTGACGAT CATCCTTCTG GGTGTTTCAC 

8 5 01 GTATGACCTA TTGCCTGTTG TACAACTTGC TTTAAACAAC ACCTATAGCC 
CATACTGGAT AACGGACAAC ATGTTGAACG AAATTTGTTG TGGATATCGG 
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8 551 CTGTATTAAA ATATACTCCA CATCAACTCT TATTTGGTAT AGATTCAAAT 
GACATAATTT TATATGAGGT GTAGTTGAGA ATAAACCATA TCTAAGTTTA 

8 601 ACTCCATTTG CAAATCAAGA TACACTTGAC TTGACCAGA6 AAGAAGAACT 
TGAGGTAAAC GTTTAGTTCT ATGTGAACTG AACTGGTCTC TTcJtcJJgI 

8 0-51 TTCTCTTTTA CAGGAAATTC GTACTTCTTT ATACCATCCA TCCACCCCTC 

aagagaaaat gtcctttaag catgaagaaa tatggtaggt aggtggggag 

8701 cagcctcctc tcgttcctgg tctcctgttg ttggccaatt ggtccaggag 
gtcggaggag agcaaggacc agaggacaac aaccggttaa ccaggtcctc 

'■'^^ ft°°?I^^*^^^ ggcctgcttc tttgagacct cgttggcata aaccgtctac 
T^^^ACCGAT ccggacgaag aaactctgga gcaaccgtat ttggcagatg. 

8 801 tgtacttaag gtgttgaatc caaggactgt tgttattttg gaccatcttg 
\ acatgaattc cacaacttag gttcctgaca acaataaaac ctggtagaac 

I 88 51 gcaacaacag aactgtaagt atagataatt taaaacctac ttctcatcag 
I cgttgttgtc ttgacattca tatctattaa attttggatg aagagtagtc 

I 8 901 aatggcacca ccaatgacac tgcaacaatg gatcatttgg aaaaaaatga 

ttaccgtggt ggttactgtg acgttgttac ctagtaaacc tttttttact 

^^^^ ^'itf^'l^'l^^ tgaggcactt caaaatacaa caactgtgac tgaacagcag 

TATTTCGCGT ACTCCGTGAA GTTTTATGTT GTT6ACACTG ACTTGTCGTC 

9001 AAGGAACAAA TTATACTGGA CATTCAAAAT GAAGAAGTAC AACCAACTAG 
TTCCTTGTTT AATATGACCT GTAAGTTTTA CTTCTTCATG TTGGTTGATC 

9051 GAGAGATAAA TTTAGATATC TGCTTTATAC TTGTTQTGCT a'cTAGCt-CAA 
<="<=TCTATTT AAATCTATAG ACGAAATATG AACAACACGA TGATCGAGTT 

9101 GAGTATTGGC CTGGATGTTT TTAGTTTGTA TATTGTTAAT CATTGTTTTG 
CTCATAACCG C-ACCTACAAA AATCAAACAT ATAACAATTA GTAACAAAAC 

9151 GTTTCATGCT TT6TGACTAT ATCCAGAATA CAATGGAATA AGGATATTCA 
CAAAGTACGA AACACTGATA TAGGTCTTAT GTTACCTTAT TCCTATAAGT 

92 01 GGTAT.TAGGA CCTGTAATAG ACTGGAATGT TACTCAAAGA GC^GT'T'TATC 
CC.ATAATCCT GGACATTATC TGACCTTACA ATGAGTTTCT CGACAAATAG 

92 51 AACCCTTACA 6A.CTAGAAGG ATTGCACGTT CCCTTAGAAT GCAGCATCCT 
,':JJ^-^t":':''. ^"^^^^^^^^ TAACGTGCAA GGGAATCTTA cScSIggI 

93 01 GTTCCAAAAT ATGTGGAGGT AAATATGACT AGTATTCCAC AAGGTGTATA 
CAAGGTTTTA TACACCTCCA TTTATACTGA TCATAAGGTG TTCCACATAT 

CATCCGGAAC CCATAGTGGT GAAGGAGAGG GTCCTAGGTC 
GATACTTGGG GTAGGCCTTG GGTATCACCA CTTCCTCTCC CAGGATCCAG 

Ulnln^tl TCTGATGATT AATTCAGAAA ACATTGCTAA TAAtVcVaIt 
AAAGAGTTTA AGACTACTAA TTAAGTCTTT TGTAACGATT ATTACGATTA 

94 51 TTGACACAAG AAGTAAAGAA GTTGTTAACT GAAATGGTTA ATGAAGAAAT 
AACTGTGTTC TTCATTTCTT CAACAATTGA CTTTACCAAT TACTTCTTTA 
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9 501 GCAAAGTTTG TCAGATGTAA TGATTGACTT TGAAATTCCT TTAGGAGACC 
CGTTTCAAAC AGTCTACATT ACTAACTGAA ACTTTAAGGA AATCCTCTGG 



9 5 51 CTCGTGATCA AGAACAATAT ATACATAGAA AATGCTATCA AGAATTTGCA 
GAGCACTAGT TCTTGTTATA TATGTATCTT TTACGATAGT TCTTAAACGT 



96 01 AATTGTTATT TAGTAAAATA TAAAGAACCC AAACCGTGGC CTAAGGAGGG 
TTAACAATAA ATCATTTTAT ATTTCTTGGG TTTGGCACCG GATTCCTCCC 



9651 CCTTATAGCT GATCAATGCC CATTACCAGG TTACCATGCT GGATTAACCT 
GGAATATCGA CTAGTTACGG GTAATGGTCC AATGGTACGA CCTAATTGGA 



97 01 ATAATAGACA GTCTATTTGG GATTACTATA TTAAAGTGGA GAGTATTAGA 
TATTATCTGT CAGATAAACC CTAATGATAT AATTTCACCT CTCATAATCT 



9 7 51 CCTGCAAATT GGACAACAAA GAGTAAATAT GGACAAGCTA GACTAGGAAG 
GGACGTTTAA CCTGTTGTTT CTCATTTATA CCTGTTCGAT CTGATCCTTC 



9 8 01 TTTTTATATT CCTAGCAGCC TGAGACAAAT CAATGTTAGT CATGTACTAT 
AAAAATATAA GGATCGTCGG ACTCTGTTTA GTTACAATCA GTACATGATA 



TCTGTAGTGA TCAATTATAT TCTAAATGGT ATAATATAGA AAATACCATA 
AGACATCACT AGTTAATATA AGATTTACCA TATTATATCT TTTATGGTAT 



9 9 01 GAACAAAACG AGCGGTTTCT GCTTAATAAA CTAAATAACC TTACATCTGG 
CTTGTTTTGC TCGCCAAAGA CGAATTATTT GATTTATTGG AATGTAGACC 



9 9 51 AACCTCAGTA TTGAAGAAAA GAGCTCTTCC GAAGGATTGG AGTTCTCAAG 
TTGGAGTCAT AACTTCTTTT CTCGAGAAGG -CTTCCTAACC TCAAGAGTTC 



100 01 GTAAAAATGC TCTGTTTAGA GAAATCAATG TGTTAGATAT CTGCAGTAAA 
CATTTTTACG AGACAAATCT CTTTAGTTAC ACAATCTATA GACGTCATTT 



10051 CCTGAATCTG TAATACTATT GAATACTTCA TACTATTCCT TC^CTTTATG 
GGA.CTTAGAC ATTATGATAA CTTATGAAGT ATGATAAGGA AGAGAAATAC 



.10101 GGAAGGAGAT TGTAATTTTA CTAAAGATAT GATTTCTCAG TTGGTTCCAG 

CCTTCCTCTA ACATTAAAAT GATTTCTATA CTAAAGAGTC AACCAAGGTC 

mi 51 AATGTGATGG ATTTTATAAC AATTCTAAGT GGATGCATAT GCATCCAT-aT 

TTACACTACC TAAAATATTG TTAAGATTCA CCTACGTATA CGTAGGTATA 



102 01 GCTTGTAGAT TCTGGAGAAG TAAGAAGAAT GAAAAAGAAG AAACTAAATG 
CGAACATCTA AGACCTCTTC ATTCTTCTTA CTTTTTCTTC TTTGATTTAC 



10251 TAGAGATGGG GAAACTAAGA GATGTCTGTA TTATCCTTTA TGGGACAGTC 
•ATCTCTACCC CTTTGATTCT CTACAGACAT AATAGGAAAT ACCCTGTCAG 



10301 CCGAATCTAC ATATGATTTT GGTTATTTAG CATACCAAAA GAATTTTCCT 
GGCTTAGATG TATACTAAAA CCAATAAATC GTATGGTTTT CTTAAAAGGA 



103 51 TCCCCTATCT GTATAGAACA ACAGAAAATT AGAGATCAAG ATTATGAAGT 
AGGGGATAGA CATATCTTGT TGTCTTTTAA TCTCTAGTTC TAATACTTCA 



10401 CTATTCTTTG TATCAAGAAC GCAAAATAGC TTCTAAAGCA TATGGAATTG 
GATAAGAAAC ATAGTTCTTG CGTTTTATCG AAGATTTCGT ATACCTTAAC 
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104 51 ATACAGTTTT ATTCTCTCTA AAGAATTTTC TTAATTATAC AGGAACTCCT 
TATGTCAAAA TAAGAGAGAT TTCTTAAAAG AATTAATATG TCCTTGAGGA 



10 501 GTAAATGAAA TGCCTAATGC AAGAGCTTTT GTAGGCCTAA TAGATCCCAA 
CATTTACTTT ACGGATTACG TTCTCGAA^A CATCCGGATT ATCTAGGGTT 



10 551 GTTTCCTCCT TCCTATCCCA ATGTTACTAG GGAACATTAT ACTTCCTGTA 
CAAAGGAGGA AGGATAGGGT TACAATGATC CCTTGTAATA TGAAGGACAT 



10 601 ATAATAGGAA AAGAAGAAGT GTTGATAATA ACTATGCTAA GTTAAGGTCT 
TATTATCCTT TTCTTCTTCA CAACTATTAT TGATACGATT CAATTCCAGA 



10 6 5?. ATGGGGTATG CACTTACAGG AGCAGTGCAA ACCTTATCTC AAATATCAGA 
TACCCCATAC GTGAATGTCC TCGTCACGTT TGGAATAGAG TTTATAGTCT 



107 01 TATTAATGAT GAAAACTTAC AGCAAGGAAT ATATTTATTA AGGGATCATG 
ATAATTACTA CTTTTGAATG TCGTTCCTTA TATAAATAAT TCCCTAGTAC 



10 751 TAATAACCTT AATGGAAGCT ACATTGCATG ATATATCTGT TATGGAAGGA 
ATTATTGGAA TTACCTTCGA TGTAACGTAC TATATAGACA ATACCTTCCT 



108 01 ATGTTTGCTG TACAACATTT GCATACACAT TTGAATCATT TGAAGACAAT 
TACAAACGAC ATGTTGTAAA C6TATGTGTA AACTTAGTAA ACTTCTGTTA 



108 51 GCTTCTAGAA AGAAGAATAG ACTGGACCTA TATGTCTAGT ACTT-GGCTAC 
CGAAGATCTT TCTTCTTATC TGACCTGGAT ATACAGATCA TGAACCGATG 

10 9 01 AACAACAATT ACAGAAATCT GATGATGAGA TGAAAGTAAT AAAGAGAATT 
TTGTTGTTAA TGTCTTTAGA CTACTACTCT ACTTTCATTA TTTCTCTTAA 



10 951 GCTAGAAGTT TGGTATATTA TGTTAAACAA ACCCATAGTT CTCCCACAGC 
CGATCTTCAA ACCATATAAT ACAATTTGTT TGGGTATCAA GAGGGTGTCG 



11001 TACAGCCTGG GAGATTGGAT TATATTATGA ATTGGTTATA CCTAAACATA 
ATGTCGGACC CTCTAACCTA ATATAATACT TAACCAATAT GGATTTGTAT 



11051 TTTACTTGAA TAATTGGAAT GTTGTCAATA TAGGTCACTT AGTTAAATCA 
AAATGAACTT ATTAACCTTA CAACAGTTAT ATCCAGTGAA TCAATTTAGT 



11101 GCTGGACAAT TGACTCATGT AACTATAGCT CATCCTTATG AAATAAT'-AA 
CGACCTGTTA ACTCAGTACA TTGATATCGA GTAGGAATAC TTTATTAGTT 



11151 TAAGGAATGT GTAGAGACTA TATATCTGCA TCTTGAGGAC TGCAPAAGAC 
ATTCCTTACA CATCTCTGAT- ATATAGACGT AGAACTCCTG ACGTGTTCTG 



11201 AAGATTATGT CATATGTGAT GTGGTAAAGA TAGTGCAGCC TTGTGGCAAT 
TTCTAATACA GTATACACTA CACCATTTCT ATCACGTCGG AACACCGTTA 



112 51 AGCTCAGACA CGAGTGATTG TCCTGTCTGG GCTGAAGCTG TAAAAGAACC 
TCGAGTCTGT GCTCACTAAC AGGACAGACC CGACTTCGAC ATTTTCTTGG 



11301 ATTTGTGCAA GTCAATCCTC TGAAAAACGG AAGTTATCTG GTTTTGGCAA 
TAAACACGTT CAGTTAGGAG ACTTTTTGCC TTCAATAGAC CAAAACCGTT 



113 51 GTTCCACAGA CTGTCAGATC CCACCATATG TTCCTAGCAT CGTGACTGTT 
CAAGGTGTCT GACAGTCTAG GGTGGTATAC AAGGATCGTA GCACTGACAA 
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114 01 AATGAAACAA CGTCATGCTT TGGACTGGAC TTTAAAAGGC CACTGGTTGC 
TTACTTTGTT GCAGTACGAA ACCTGACCTG AAATTTTCCG GTGACCAACG 



114 51 GGAAGAAAGA TTGAGCTTTG AGCCACGACT GCCAAATCTA CAACTAAGAT 
CCTTCTTTCT AACTCGAAAC TCGGTGCTGA CGGTTTAGAT GTTGATTCTA 



115 01 TACCACATTT GGTTGGAATT ATTGCAAAAA TCAAAGGGAT AAAAATAGAA 
ATGGTGTAAA CCAACCTTAA TAACGTTTTT AGTTTCCCTA TTTTTATCTT 



11551 GTCACATCCT CTGGAGAAAG TATAAAAGAG CAGATTGAAA GAGCAAAAGC 
CAGTGTAGGA GACCTCTTTC ATATTTTCTC GTCTAACTTT CTCGTTTTCG 



11601 TGAGCTCCTT CGACTGGACA TTCACGAGGG AGATACTCCT GCCTGGATAC 
ACTCGAGGAA GCTGACCTGT AAGTGCTCCC TCTATGAGGA CGGACCTATG 



11651 AACAGCTAGC TGCAGCAACA AAGGACGTCT GGCCAGCAGC AGCTTCTGCT 
TTGTCGATCG ACGTC6TTGT TTCCTGCAGA CCGGTCGTCG TCGAAGACGA 



117 01 CTACAAGGAA TTGGTAACTT TTTATCTGGG ACTGCCCAAG GAATATTTGG 
GATGTTCCTT AACCATTGAA AAATAGACCC TGACGGGTTC CTTATAAACC 



117 51 AACTGCCTTT AGTCTCTTGG GATACTTAAA GCCTATCCTA ATAGGAGTAG 
TTGACGGAAA TCAGAGAACC CTATGAATTT CGGATAGGAT TATCCTCATC 



118 01 GGGTCATTCT CTTGGTTATT CTTATATTTA AGATTGTATC ATGGATTCCT 
CCCAGTAAGA GAACCAATAA GAATATAAAT TCTAACATAG TACCTAAGGA 



118 51 ACGAAAAAGA AGAATCAGTA GCCTCCACCT CTGGAATTCA AGACCTGCAG 
TGCTTTTTCT TCTTAGTCAT CGGAGGTGGA GACCTTAAGT TCTGGACGTC 



11901 ACTCTGAGTG AGCTTGTTGG TCCTGAAAAT GCCGGAGAGG GAGAGCTGAC 
TGAGACTCAC TCGAACAACC AGGACTTTTA CGGCCTCTCC CTCTCGACTG 



11951 TATTGCTGAG GAACCTGAAG AAAATCCTCG ACGCCCCAGA CGATATACTA 
ATAACGACTC CTTGGACTTC TTTTAGGAGC TGCGGGGTCT GCTATATGAT 



12 001 AAAGAGAAGT CAAATGTGTG TCTTATCATG CATATAAAGA AATTGAGGAC 
TTTCTCTTCA GTTTACACAC AGAATAGTAC GTATATTTCT TTAACTCCTG 



12 0 51 AAACATCCTC AACATATTAA ACTGCAGGAT TGGATCCCCA CACCAGAGGA 
TTTGTAGGAG TTGTATAATT TGACGTCCTA ACCTAGGGGT GTGGTCTCCT 



12101 AATGAGTAAG TCACTCTGTA AAAGACTTAT TTTATGTGGA TTGTATAGTG 
TTACTCATTC AGTGAGACAT TTTCTGAATA AAATACACCT AACATATCAC 



1215a CAGAAAAGGC CTCAGAGATT TTAAGGATGC CTTTTACAGT TTCTTGGGAA 
GTCTTTTCCG GAGTCTCTAJ\ AATTCCTACG GAAAATGTCA AAGAACCCTT 



12201 CAATCAGATA CTGACCCTGA CTGTTTTATT GTAAGCTATA CATGTATATT 
GTTAGTCTAX GACTGGGACT GACAAAATAA CATTCGATAT GTACATATAA 



12251 TTGTGATGCT GTAATACATG ATCCCATGCC CATAAGATGG GATCCTGAAG 
AACACTACGA CATTATGTAC TAGGGTA'CGG GTATTCTACC CTAGGACTTC 



12 3 01 TTGGAATTTG ggtaaaatat aaacccctca gaggaattgt tggatctgct 

AACCTTAAAC CCATTTTATA TTTGGGGAGT CTCCTTAACA ACCTAGACGA 
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12351 


GTGTTTATTA 
CACAAATAAT 


TGCATAAACA 
ACGTATTTGT 


TCAAAGAAAC 
AGTTTCTTTG 


TGTTCTCTTG 
ACAAGAGAAC 


TTAAACCTTC 
AATTTGGAAG 


12401 


TACCAGTCGC 
ATGGTCAGCG 


TCAGAAGGTC 
AGTCTTCCAG 


CAAAACCAAG 
GTTTTGGTTC 


ACCTAGGCAC 
TGGATCCGTG 


GATCCTGTCC 
CTAGGACAGG 


12451 


TTCGATGTGA 
AAGCTACACT 


CATGTTTGAA 
GTACAAACTT 


AAGCATCACA 
TTCGTAGTGT 


AGCCTCGGCA 
TCGGAGCCGT 


GAAACGACCC 
CTTTGCTGGG 


12501 


AGGAGACGAT 
TCCTCTGCTA 


CCATCGATAA 
GGTAGCTATT 


TGAGTCATGT 
ACTCAGTACA 


GCTTCCAGTA 
CGAAGGTCAT 


GTGACACCAT 
CACTGTGGTA 


12551 


GGCCAATGAG 
CCGGTTACTC 


CCAGGATCAC 
GGTCCTAGTG 


TATGCACCAA 
ATACGTGGTT 


CCCTCTTTGG 
GGGAGAAACC 


AATCCTGGAC 
TTAGGACCTG 


1 12601 


CGCTACTATC 
GCGATGATAG 


AGGGCTACTT 
TCCCGATGAA 


GAAGAGTCCA 
CTTCTCAGGT 


GCAACCTACC 
CGTTGGATGG 


AAACTTGGAA 
TTTGAACCTT 


12651 


GTTCACATGT 
CAAGTGTACA 


CAGGTGGACC 
GTCCACCTGG 


CTTCTGGGAA 
GAAGACCCTT 


GAGGTTTATG 


GGGACTCAAT 


12701 


TTTGGGTCCC 
AAACCCAGGG 


CCCTCTGGGT 
GGGAGACCCA 


CAGGTGAACA 
GTCCACTTGT 


TTCAGTTTTA 
AAGTCAAAAT 


TAAGAATTAT 
ATTCTTAATA 


12751 

i 


CAGATTCTAA 
GT CT AAGATT 


CTTGCTGTCA 
GAACGACAGT 


GGCTGTAGAT 
CCGACATCTA 


CTTCTTAAGC 
GAAGAATTCG 


TTGCGGGAGA 
AACGCCCTCT 


1 

! 




Apal 






j 12801 


CGTCGAGTCC 
GCAGCTCAGG 


AACCCTGGGC 
TTGGGACCCG 


CCGATATCCC 
GGCTATAGGG 


CATGGTGAGC 
GTACCACT.CG 


AAGGGCGAGG 
TTCCCGCTCC 


i 12851 
1 


AGCTGTTCAC 
TCGACAAGTG 


CGGGGTGGTG 
GCCCCACCAC 


CCCATCCTGG 
GGGTAGGACC 


TC6AGCTGGA 
AGCTCGACCT 


CGGCGACGTA 
GCCGCTGCAT 


j IZ901 

I * 


AACGGCCACA 
TTGCCGGTGT 


AGTTCAGCGT 
TCAAGTCGCA 


Gl^CCGGCGAG 
CAGGCCGCTC 


GGCGAGGGCG 
CCGCTCCCGC 


ATGCCACCTA 
TACGGTGGAT 


12951 

i 
I 


CGGCAAGCTG 
GCCGTTCGAC 


ACCCTGAAGT 
TGGGACTTCA 


TCATCTGCAC 
AGTAGACGTG 


CACCGGCAAG 
GTGGCCGTTC 


CTGCCCGTGC 
GACGGGCACG 


13001 

1 
[ 


CCTGGCCCAC 
GGACCGGGTG 


CCTCGTGACC 
aCAGCACTGG 


ACCTTCACCT 
TGGAAGTGGA 


ACGGCGTGCA 
TGCCGCACGT 


GTGCTTGAGC 
CACGAAGTCG 


1 

; 15051 


CGCTACCCCG 
GCGATGGGGC 


ACCACATGAA 
TGGTGTACTT 


GCAGCACGAC 
CGTCGTGCTG 


TTCTTCAAGT 
AAGAAGTTCA. 


CCGCCATGCC 
GGCGGTACGG 


13101 


CGAAGGCTAC 
GCTTCCGATG 


GTCCAGGAGC 
CAGGTCCTCG 


GCACCATCTT 
CGTGGTAGAA 


CTTCAAGGAC 
GAAGTTCCTG 


GACGGCAACT 
CTGCCGTTGA 


13151 


ACAAGACCCG 
TGTTCTGGGC 


CGCCGAGGTG 
GCGGCTCCAC 


AAGTTCGAGG 
TTCAAGCTCC 


GCGACACCCT 
CGCTGTGGGA 


GGTGAACCGC 
CCACTTGGCG 


13201 


ATCGAGCTGA 
TAGCTCGACT 


AGGGCATCGA 
TCCCGTAGCT 


CTTCAAGGAG 
GAAGTTCCTC 


GACGGCAACA 
CTGCCGTTGT 


TCCTGGGGCA 
AGGACCCCGT 


13251 


CAAGCTGGAG 
GTTCGACCTC 


TACAACTACA 
ATGTTGATGT 


ACAGCCACAA 
TGTCGGTGTT 


CGTCTATATC 
GCAGATATAG 


ATGGCCGACA 
TACCGGCTGT 
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13301 AGCAGAAGAA CGGCATCAAG GTGAACTTCA AGATCCGCCA CAACATCGAG 
TCGTCTTCTT GCCGTAGTTC CACTTGAAGT TCTAGGCGGT GTTGTAGCTC 



13 3 51 GACGGCAGCG TGCAGCTCGC CGACCACTAC CAGCAGAACA CCCCCATCGG 
CTGCCGTCGC ACGTCGAGCG GCTGGTGATG GTCGTCTTGT GGGGGTAGCC 



13401 CGACGGCCCC GTGCTGCTGC CCGACAACCA CTACCTGAGC ACCCAGTCCG 
GCTGCCGGGG CACGACGACG GGCTGTTGGT GATGGACTCG TGGGTCAGGC 



13 4 51 CCCTGAGCAA AGACCCCAAC GAGAAGCGCG ATCACATGGT CCTGCTGGAG 
GGGACTCGTT TCTGGGGTTG CTCTTCGCGC TAGTGTACCA GGACGACCTC 



13 5 01 TTCGTGACCG CCGCCGGGAT CACTCTCGGC ATGGACGAGC TGTACAAGTA 
AAGCACTGGC GGCGGCCCTA GTGAGAGCCG TACCTGCTCG ACATGTTCAT 



Nod 



13 5 51 AAGCGGCCGC GACTCTAGGG GATTCGCJAT AAGTAAGTAA GCTTATGGAC 
TTCGCCGGCG CTGAGATCCC CTAAGCGCTA TTCATTCATT CGAATACCTG 



13 601 CTCAGAGAGG AAGTAACGAG GAGAGGGTGT GGTGGAATGT CACTAGAAAC 
GAGTCTCTCC TTCATTGCTC CTCTCCCACA CCACCTTACA GTGATCTTTG 



13 651 CAGGGAAAAC AAGGAGGAGA GTATTACAGG GAAGGAGGTG AAGAACCTCA 
GTCCCTTTTG TTCCTCCTCT CATAATGTCC CTTCCTCCAC TTCTTGGAGT 



13 701 TTACCCAAAT ACTCCTGCTC CTCATAGACG TACCTGGGAT GAGAGACACA 
AATGGGTTTA TGAGGACGAG GAGTATCTGC ATGGACCCTA CTCTCTGTGT 



13 751 AGGTTCTTAA ATTGTCCTCA TTCGCTACTC CCTCTGACAT CCAACGCTGG 
TCCAAGAATT TAACAGGAGT AAGCGATGAG GGAGACTGTA GGTTGCGACC 



13 8 01 GCTACTAAAG CATTGCCTTA TGGCTGGAAA GTGGTCACCG AAAGCGGAAA 
CGATGATTTC GTAACGGAAT ACCGACCTTT CACCAGTGGC TTTCGCCTTT 



13 851 * TGATTATACT AGCCGCAGAA AGATCAGAAC ATTGACAGAG ATGACTCAGG 
ACTAATATGA TCGGCGTCTT TCTAGTCTTG TAACTGTCTC TACTGAGTCC 



13 901 ATGAAATTAG AAAAAGGTGG GAAAGTGGAT ATTGTGACCC CTTCATTGAC 
TACTTTAATC TTTTTCCACC CTTTCACCTA TAACACTGGG GAAGTAACTG 



13 951 TCAGGAAGT3 ACTCAGATGG ACCCTTCTAA AAGCCACAGA CAGTAAAAAT 
AGTCCTTCAC TGAGTCTACC TGGGAAGATT TTCGGTGTCT GTCATTTTTA 



14001 GTGTTAGCAC TTTATACAAT ATTATATCTG CTTAAGCTAT AGAAGCTTTC 
CACAATCGTG AAATATGTTA TAATATAGAC GAATTCGATA TCTTCGAAAG 



140 51 ACATACTCAG TAGCTGTTTC ACAATCAACA AAACAATGAT GATGTAATCA 
TGTATGAGTC ATCGACAAAG TGTTAGTTGT TTTGTTACTA CTACATTAGT 



14101 TAAGGAAGTA GTTTAAATAG GTTAATAAGT TTATTAGTTA TATAGAAAAT 
ATTCCTTCAT CAAATTTATC CAATTATTCA AATAATCAAT ATATCTTTTA 



14151 AATATAGGAT AAAGTATAAG GATTAAGGTA TGAGGTGTGT GGCTCAACAC 
TTATATCCTA TTTCATATTC CTAATTCCAT ACTCCACACA CCGAGTTGTG 



142 01 GTAGGGTGAC AAGAAAATCT ACTGTAATAG GACACAACAC CTCTAAAGTT 
CATCCCACTG TTCTTTTAGA TGACATTATC CTGTGTTGTG GAGATTTCAA 
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SEQ ID NO: 2 

a tgggtgcgag agcgtcggta ttaagcgggg gagaattaga 
taaatgggaa 

841 aaaattcggt taaggccagg gggaaagaaa caatataaac taaaacatat 
agtatgggca 

901 agcagggagc tagaacgatt cgcagttaat cctggccttt tagagacatc 
agaaggctgt 

961 agacaaatac tgggacagct acaaccatcc cttcagacag gatcagaaga 
acttagatca 

1021 ttatataata caatagcagt cctctattgt gtgcatcaaa ggatagatgt 
aaaagacacc 

1081 aaggaagcct tagataagat agaggaagag caaaacaaaa gtaagaaaaa 
ggcacagcaa 

1141 gcagcagctg acacaggaaa caacagccag gtcagccaaa attaccctat 
agtgcagaac 

12 01 ctccaggggc aaatggtaca tcaggccata tcacctagaa ctttaaatgc 
atgggtaaaa 

12 61 gtagtagaag agaaggcttt cagcccagaa gtaataccca tgttttcagc 
attatcagaa 

1321 ggagccaccc cacaagattt aaataccatg ctaaacacag tggggggaca 

tcaagcagcc 

13 81 atgcaaatgt taaaagagac catcaatgag gaagctgcag aatgggatag 
attgcatcca 

1441 gtgcatgcag ggcctattgc accaggccag atgagagaac caaggggaag 
tgacatagca 

1501 ggaactacta gtacccttca ggaacaaata ggatggatga cacataatcc 
acctatccca 

1561 fjtaggagaaa tctataaaag atggataatc ctgggattaa ataaaatagt 
aagaatgtat 

1621 agccctacca gcattctgga cataagacaa ggaccaaagg aaccctttag 
agactatgta 

1681 gaccgattct ataaaactct aagagccgag caagcttcac aagaggtaaa 
aaattggatg 

1741 acagaaacct tgttggtcca aaatgcgaac ccagattgta agactatttt 
aaaagcattg 

1801 ggaccaggag cgacactaga agaaatgatg acagcatgtc agggagtggg 
gggacccggc 

1861 cataaagcaa gagttttggc tgaagcaatg agccaagtaa caaatccagc 
taccataatg 

1921 atacagaaag gcaattttag gaaccaaaga aagactgtta agtgtttcaa 
ttgtggcaaa 

1981 gaagggcaca tagccaaaaa ttgcagggcc cctaggaaaa agggctgttg 
gaaatgtgga 

2041 aaggaaggac accaaatgaa agattgtact gagagacagg ctaatttttt 
agggaagatc 

2101 tggccttccc acaagggaag gccagggaat tttcttcaga gcagaccaga 
gccaacagcc 

2161 ccaccagaag agagcttcag gtttggggaa gagacaacaa ctccctctca 
gaagcaggag 

2221 ccgatagaca aggaactgta tcctttagct tccctcagat cactctttgg 
cagcgacccc 
2281 tcgtcacaat aa 



SEQ ID NO: 3 

MGARASVLSGGELDKWEKmLRPGGKKQYKLKHIVWASRELERFAVNP^^^ 
TSEGCRQILGQLQPSLQTGSEELRSLYNTIAVLYCVHQRTOVKJJTKEALDKIEEE 
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QNKSKKKAQQAAADTGNNSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKV 

VEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAE 

WDRLHPVHAGPIAPGQMI^PRGSDIAGTTSTLQEQIGWMTHNPPIPVGEIYKR 

WnLGLlSIXIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMT 

ETLLVQNANPDCKTILKALGPGATLEEMMTACQGVGGPGHKARVLAEAMSQ 

VTNPATIMIQKGNFRNQRKTVKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGH 

QMKDCTERQANFLGKIWPSHKGRPGNfLQSRPEPTAPPEESFRFGEETTTPSQK 

QEPIDKELYPLASLRSLFGSDPSSQ 



SEQ ID NO: 4 

MGARASVI^GGELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLE 

TSEGCRQILGQLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALDKIEEE 

QNKSKKKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQAISPRTLNAWVKV 

VEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAE 

WDRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKR 

\^^GLNKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMT 

ETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLAEAMSQ^ 

VTNSATIMMQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKGC\VKCGKBG 

HQMKDCTERQANFLGKIWPSYKGRPGNFLQSRPEPTAPPFLQSRPEPTAPPEES 

FRSGVETTTPSQKQEPIDKELYPLtSLRSLFGNDPSSQ ? 



SEQ ID NO: 5 

MGARASVLSGGELDR\N^KVRLRPGGKKKYKLKmVWASRELERFAVNPGLLE 

TSEGCRQILGQLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALDKIEEE 

QNKSKKKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQAISPRTLNAW\T<:V 

VEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAE 

WDRVHP\^AGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKR 

WnLGLNKTVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMT 

ETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLAEAMSQ 

VTNSATIMMQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKGCWKCGKEG 

HQMKDCTERQANFLGKIWPSYKGRPGNFLQSRPEPTAPPFLQSRPEPTAPPEES 

FRSGVETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 



SEQ ID NO: 6 

TCC GGG CCC GGA ATG CCT ATA GTC CAG AAC ATC C 



SEQ ID NO: 7 

GCG GCC GCG TTT TGA GAA CGA AAT ACC GG 



SEQ ID NO: 8 
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SEQ ID N0:1 with SEQ ED NO: 2 inserted between 12816 and 13552. 
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LIVE REPLICATING SPUMAVIRUS VECTOR 
BACKGROUND OF THE INVENTION 

5 1 . Spumavirus, also known as foamy virus for the characteristics of 

vacuolization the virus induces in cell culture, belongs to a distinct group of 
retroviruses. The sinndan foamy viruses (SFVs) include isolates from Old World and 
New World monkeys and are classified into 10 different serotypes based on serological 
cross-reactivities. Virus appears to persist in tihte host for a long period of time in a 

10 latent fonn and can exist in the presence of neutralizing antibody. 

2. Currently the most studied retrovirus. Human Immxmodeficiency Virus, is 
believed to be derived from nonhuman primate transmission into humans at some past 
time. Concerns about the risk of transmission of retrovimses from non-human primates 
to hiraians working in research laboratories were heightened in the early 1990's when 

1 5 two persons developed antibodies to SIV (Simian Immimodeficiency Virus) following 
work-related exposures, one of whom had clear evidence of persistent viral infection, 
(See CDC anonymous survey for simian immunodeficiency virus (SIV) seropositivity 
in SrV laboratory researchers - United States, 1992. MMWR Morb. Mort. Wkly. Rep, 
1992; 41:814-5; Khabbaz R.F., et al. Brief report: infection of a laboratory worker with 

20 simian inamunodeficiency virus. New Eng. J. Med. 1994; 330:172-7; Khabbaz R.F., et 
al. Simian inmaunodeficiency virus needle stick accident in a laboratory worker. 
Lancet 1992; 340:271-3; and CDC. Guideline to prevent simian innmunodeficiency 
virus infection in laboratory workers and animal handlers. MMWR 1988; 37:693-704.) 
In addition to SIV, nonhuman primate species used in biomedical research are 

25 commonly infected with SFV (simian foamy vims), STLV (sindan t-cell lymphotrophic 
virus), and/or type D retroviruses. All of these retroviruses cause lifelong infections in 
nonhuman primates, and some are known to be transmissible through sexual contact, 
blood, or breast-feeding. Natural SFV infections in non-hiraian primates have not been 
definitively associated with disease. In non-human primates, infection with the other 

30 retroviruses may result in a clinical spectrum ranging from asymptomatic infection to 
life threatening immunodeficiency syndromes or lymphoproliferative disorders. The 
transmission routes of SFVs among non-himian primates remain undefined, but the 
prevalence of seroreactivity is high among captive adult non-himaan primates. 
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3. Recent publications indicate that earlier serological tests showing human 
spiimavirus antibodies in the human populating were incorrect Immunological 
investigation of a previously reported human spumavirus revealed that it shared 
common antigens in complement fixation, immunofluorescence and neutralization 

5 assays with the chimpanzee foamy virus, SFV-6. Furthermore, failure to detect 
serological evidence of HFV infection in people from a wide geographical area 
suggested that this virus isolate was a variant of SFV-6 particularly since sgtb, from 
chimpanzees naturally infected with SFV-6 neutralized both vimses. In a survey for 
prevalence of human foamy virus in more than 5000 human sera, collected from 

10 geographically diverse populations, none of the serum samples were confirmed as 

positive. Taken together with sequence analysis endorsing the phylogenetic closeness 
of the pvirported hxmian spumavirus to SFV-6/7, these data strongly suggest that hiunan 
foamy virus is not naturally found in the hxmian populations. (See Ali, M. et al., '"No 
Evidence of Antibody to Himian Foamy Vims in Widespread Human Populations," 

15 AIDS Research and Human Retrovkuses, Vol. 12, NO. 15, 1996). 

4. Gene therapies have long looked for a good vector that can transport the 
foreign gene of choice into hxmian cells. Thus, compositions and methods for gene 
therapy are needed that use a vector capable of carrying a significant amount of foreign 
DNA that will enter the host organism and not cause disease. 

20 5. Compositions and methods for vaccination using recombinant live 

retroviruses are also needed. A live virus, that causes no illness in humans, and that has 
genes of antigens of choice incorporated into its genome, would provide for an 
excellent vaccination tool as a vector. The retrovims would reproduce in the human 
host and expose the inmume system to antigens so that an immune response can be 

25 initiated. 

6. Targeted attack on reproducing cells is a goal of cancer treatment. What is 
needed are compositions and methods for cancer treatment that are specific for dividing 
cells tiiat do not cause systemic damage to the cancer patient. A viral vector that could 
infect and kill dividing cells, without killing other cells of the host would provide a 
30 solution for cancer treatment. 
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7. The lack of any known disease associated with the virus from which the 
vector of the present invention was derived makes the present invention ideal for gene 
therapy regimens. 

SUMMARY OF THE INVENTION 

5 8. In accordance with the purposes of this invention, as embodied and broadly 

described herein, this invention, in one aspect, relates to live replicating retroviral 
vectors and methods of their use. 

9. Additional advantages of the invention will be set forth in part in the 
description which follows, and in part will be obvious from the description, or may be 

10 leamed by practice of the invention. The advantages of the invention will be realized 
and attained by means of the elements and combinations particularly pointed out in the 
appended claims. It is to be understood that both the foregoing general description and 
the following detailed description are exemplaiy and explanatory only and are not 
restrictive of the invention, as claimed. 

15 10, The present invention provides compositions comprising live replicating p 

retroviral vectors, wherein the vector is derived from a spumavirus, and wherein the 
vector ftirther comprises a nucleic acid that encodes a non-spxraiavirus peptide, 
polypeptide, or protein. Thus, the present invention provides compositions comprising 
live replicating spumavirus vectors. 

20 11. Also provided by the present invention is a method of treating a subject with . 

a condition, wherein the condition can be a viral infection, bacterial infection, parasitic 
infection, proliferative disorder (eg. cancer), or a condition associated with a genetic or 
autoimmune disorder; comprising administering to the subject a live replicating viral 
vector, wherein the immunizing construct is specific for the condition. 

25 12. Also provided by the present invention is a method of preventing a 

condition in a subject, wherein the condition can be a viral infection, bacterial infection, 
parasitic infection, proliferative disorder, or a condition associated wilh a genetic or 
autoimmune disorder; comprising administering to the subject a live replicating viral 
vector, wherein the antigen-encoding nucleic acid is specific for the condition. 

30 13. Also provided are methods of using the present vector for making models 

and using niodels to study diseases and potential treatments, as well as the models 
themselves. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

14. The accompanying drawings, which are incorporated in and constitute a part 
of this specification, illustrate several embodiments of the invention and together with 
the description, serve to explain the principles of the invention. 
5 15. Figure 1 shows a map of the pHSV vector detailing the location of the 

spumavirus env, pol, gag, and beUl (tas) genes. The map also indicates the presence of 
LTR flanking the coding region for the spumavirus genes and the presence and location 
of Apal andNotl restriction sites. 

16. Figure 2 shows a detailed restriction map of the pHSV vector. The 
10 sequence of the vector is provided in SEQ ID NO: 1 . 

17. Figure 3 shows a linear map of pHSV with the pl7/24 sequence 
incorporated in the vector to generate a pHSV-HIV-gag pl7/24 vector. This shows the 
location of pi 7/24 within the vector. 

18. Figure 4 shows that several TA-pl7/24 clones have the desired length 
15 fragment when cut with the Apal and Notl restriction enzymes. Shown is a 1% 

Acrylamide gel of purified clones resulting the transfection of pl7/24 uito the TA 
cloning vector (pCR2.1). Lane 1 is the marker BSTE H. Lane 2 is an empty pHSV 
vector used as a negative control. Lanes 3-7 are clones 1-5. 

19. Figure 5 shows that p24 expression can be measured by ELISA in clones 2, 
20 3, and 4 at days 2, 4, and 6 days post infection of BHK cells. Samples were also 

measured for p24 expression 6 days post-infection following fi:eeze/thaw (F/T) of the 
clones. 

20. Figure 6 shows that p24 expression is pronounced 5 days post infection in 
clone 2, clone 3 and clone 4. 

25 21 . Figure 7 shows that expression levels of p24 are maintained at high levels 

even on subsequent passages. Second pass clone 2 is diluted 10-fold and dilution was 
used to infect BHK cells. Supematants were removed at 3, 7, and 10 days post 
infection and measured via ELISA for p24 expression. 

22. Figure 8 shows Western blot analysis of clone 2. Lane 1 is empty pHSV 

30 vector, lane 2 is mock infected BHK cells, and lane 3 is BHK cells infected with clone 
2. Samples were probed with LTFOOl, anti-p24, or anti-HIV serum and developed. 
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DETAILED DESCRIPTION 

22. The present invention may be understood more readily by reference to the 
following detailed description of preferred embodiments of the invention and the 
Examples included therein and to the Figures and their previous and following 

5 description. 

23. Before the present compounds, compositions, and/or methods are disclosed 
and described, it is to be understood that ttds invention is not limited to specific 
synthetic methods or specific recombinant biotechnology methods unless otherwise 
specified, or to particular reagents unless otherwise specified, as such may, of course, 

10 vary. It is also to be xmderstood that the terminology used herein is for the purpose of 
describing particular embodiments only and is not intended to be limiting. 
Definitions 

24. As used in the specification and the appended claims, the singular forms "a," 
"an" and "the" include plural referents unless the context clearly dictates otherwise. 

15 Thus, for example, reference to "a pharmaceutical carrier" includes mixtures of two or 
more such carriers, and the like. 

25. Ranges may be expressed herein as firom "about" one particular value, 
and/or to "about" another particular value. When such a range is expressed, another 
embodiment includes firom the one particular value and/or to the other particular value. 

20 Similarly, when values are expressed as approximations, by use of the antecedent 

"about," it will be understood that the particular value forms another embodiment. It 
will be fiorther imderstood that the endpoints of each of the ranges are significant both 
in relation to the other endpoint, and independently of the other endpoint. 

26. In this specification and in the claims which follow, reference will be made 
25 to a number of terms which shall be defined to have the following meanings : 

27. "Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where 
said event or circumstance occurs and instances where it does not. 

Vectors 

30 28. Disclosed are live replicating human spximavims vectors suitable for human 

use comprising an immunizing construct, wherein the immunizing constmct is inserted 
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in the bet gene. The disclosed immunizing construct can be an antigen-encoding 
nucleic acid. 

29. Where reference is made to "antigen"-encoding nucleic acid, it is 
understood that in the context of the invention antigens encoded by the antigen- 

5 encoding nucleic acid can include but are not limited to immunogenic or non- 

immunogenic peptides, polypeptides, proteins, enzymes, cytokines. These antigens can 
be non-human exogenous antigenic sequences from viruses, bacteria, or parasites. The 
antigens can also be antigenic endogenous human or human derived sequences from a 
condition such as a cancer. Also, peptides encoded by the antigen-encoding nucleic 

10 acid can include non-antigenic sequences for the purposes of gene therapy. 

30. Also disclosed are vectors of the invention, wherein the vector is pHSV or 
pHSV-GFP. Though at any one time reference may only be made to one of the vectors, 
it is an embodiment of the invention that both vectors may be used interchangeably and 
with equivalent results. Accordingly terms referring to either vector should be 

1 5 understood to refer to both vectors. Similarly, reference throughout this disclosm-e is 
given to the pHSV vector containing an HIV antigen-encoding nucleic acid. It is 
understood that this construct may be referred to as pFOV-gag, pFOV7-gag, pFOV-7- 
gag, pFOV7-pl7/24, pFOV-7-p 17/24, pFOV-pl7/24, pFOV-7GFP-gag, pFOV-7-GFP- 
gag, pFOV-7GFP-pl7/24, pFOV-7-GFP-p 17/24, pFOV-7 HIV-gag pl7/24, pHSV-gag,: 

20 pHSV-pl7/24, pHSV-GFP-gag, pHSV-GFP-pl7/24 or pHSV-HTV-gag-p 17/24. A 
vector of the invention, can have the sequence of pHSV-HIV-GAG pi 7/24. Thus, 
herein disclosed is a pHSV vector of the invention comprising SEQ ID NO: 1, wherein 
an antigenic construct comprising the gag gene of HIV-1 (SEQ ID NO: 2) has been 
inserted into the sequence at the unique Apa I site at 12816 and the unique Not I site at 

25 13552. Said insertion resulting in the excision of nucleotides 12817-13551 of the 

vector and therefore partially replacing the bet gene of pHSV with SEQ ID NO: 2 (SEQ 
ID NO: 8). Therefore, also disclosed are vectors of the invention, wherein the aiitigen- 
encoding nucleic acid is HIV-GAG pl7/p24. Herein, pl7/24 and p41 are used 
synonymously and are intended to refer to DNA encoding Gag or an antigenic fragment 

30 of Gag. It is also herein contemplated that other genes from HIV may be used as the 
antigen-encoding nucleic acid in the vector of the invention. Such genes can include 
the env and pol genes of HTV as well as vpuy vz/and nef genes. 
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31. Also disclosed are vectors of the invention, wherein the vector has the 
sequence of pHSV, which is defined by tiie restriction map shown in Figures 1, 2, and 
3. It is an embodiment of the present invention that modifications can be made to the 
pHS V vector to incorporate additional unique restriction sites throughout the vector 
5 including but not limited to within and around the bel-2(bet), pol, gag, env, or bel-1 
genes. It is also understood that such modifications may provide additional features 
including but not limited to reporter function, resistance to a pharmaceutical agent, or 
enhancers.Related, but distinct vectors are disclosed in U.S. Patent No. 5,646,032 
which is incorporated in its entirety herein by reference. 
10 32. It is well-known in the art that vaccinations can be used prophylacticly for the 

prevention of infections as well as therapeutically for the treatment of ongoing 
conditions. Such infections or conditions can be but are not limited to viral infections. 
Thus, also disclosed are vectors of the invention, wherein the antigen-encoding nucleic 
acid is an antigen firom a virus. The viral antigen can be selected firom the group of 
1 5 viruses consisting of Herpes simplex virus type- 1 , Herpes simplex virus type-2. 
Cytomegalovirus, Epstein-Barr viras. Varicella-zoster virus. Human herpesvirus 6, 
Human herpesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus, 
Hepatitis A virus. Hepatitis B virus. Hepatitis C virus. Hepatitis D vims. Hepatitis E 
vims, Rhinoviras, Coronavirus, Influenza viras A, Influenza viras B, Measles viras, 
20 Polyomavims, Hxxman Papilomaviras, Respiratory syncytial viras, Adenoviras, 

Coxsackie viras, Dengue viras, Mumps viras, Polioviras, Rabies viras, Rous sarcoma 
viras, Yellow fever viras, Ebola virus, Marburg viras, Lassa fever viras, Eastern 
Equine Encephalitis viras, Japanese Encephalitis viras, St. Louis Encephalitis viras, 
Murray Valley fever viras, West Nile viras, Rift Valley fever viras, Rotaviras A, 
25 Rotavirus B, Rotaviras C, Sindbis vuiis. Simian Immunodeficiency ciras, Human T- 

cell Leukemia viras type-1, Hantaviras, Rubella viras. Simian Immunodeficiency viras, 
Himian Immunodeficiency virus type-1, and Hxmian Immunodeficiency viras type-2, 
and Simian Immunodeficiency viras (SIV). Also disclosed are vectors of the invention, 
wherein the antigen-encoding nucleic acid is SIV-GAG. The art is repleat with 
30 examples of viral antigens whose sequences and methods of obtaining them are well 
known. Thus, a pHSV vector expressing these antigens is within the scope of the 
present invention. 

7 
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33, Vaccinations are also known for the prevention of bacterial infections. 
Additionally, anitbiotics are well-known in the art for the treatment of various bacterial 
infections. Herein contemplated and disclosed are vectors of the invention, wherein the 
antigen-encoding nucleic acid is an antigen from a bacterium. The bacterial antigen 
5 can be selected from the group consisting of M tuberculosis, M. bovis, M. bovis strain 
BCG, BCG substrains, M avium, M. intracellulare, M. afiicanum, M kansasii, M. 
marinum, M. ulcerans, M avium subspecies paratuberculosis, Nocardia asteroides, 
other Nocardia species, Legionella pneumophila, other Legionella species. Salmonella 
typhi, other Salmonella species, Shigella species. Yersinia pestis, Pasteurella 
10 haemolytica, Pasteurella multocida, other Pasteurella species, Actinobacillus 

pleuropneumonia^ Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other 
Brucella species, Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia 
trachomatis. Chlamydia psittaci, Coxiella burnetii, other Rickettsial species. Ehrlichia 
species. Staphylococcus aureus, Staphylococcus epidennidis. Streptococcus pyogenes, 
15 Streptococcus agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, 

Campylobacter species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas 
aeruginosa, other Pseudomonas species, Haemophilus influenzae, Haemophilus 
ducreyi, other Hemophilus species, Clostridium tetani, other Clostridium 
species. Yersinia enterolitica, and other Yersinia species. The art is repleat with 
20 examples of bacterial antigens whose sequences and methods of obtaining them are 

well known. Thus, a pHSV vector expressing these antigens is within the scope of the 
present invention. 

34. The vectors of the invention are not limited to bacteria and vimses. Also 
disclosed are vectors of the invention, wherein the antigen-encoding nucleic acid is an 

25 antigen from a parasite. The parasitic antigen can be selected from the group consisting 
of Toxoplasma gondii, Plasmodium falciparum, Plasmodium vivax, Plasmodium 
malariae, oiher Plasmodium species., Trypanosoma brucei, Trypanosoma cruzi, 
Leishmania major, oih&r Leishmania species.. Schistosoma mansoni, otiher Schistosoma 
species., and Entamoeba histolytica. The art is repleat with examples of parasitic 

30 antigens whose sequences and methods of obtaining them are well known. Thus, a 
pHSV vector expressing these antigens is within the scope of the present invention. 
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35. The treatment of various forms of cancer is a major concern for millions of 
people worldwide and the focus of much of medical research. Herein contemplated ate 
methods of treating a cancer comprising administering to a subject the vector of the 
invention. Therefore, also disclosed are vectors of the invention, wherein the antigen- 

5 encoding nucleic acid is a tumor antigen. The tumor antigen can be selected from the 
list consisting of human epithelial cell mucin (Muc-1; a 20 amino acid core repeat for 
Muc-1 glycoprotein, present on breast cancer cells and pancreatic cancer cells), the Ha- 
ras oncogene product, p53, carcino-embryonic antigen (CEA), the raf oncogene 
product, gpl00/pmell7, GD2, GD3, GM2, TF, sTn, MAGE-1, MAGE-3, BAGE, 

10 GAGE, tyrosinase, gp75, Melan-A/Mart-l, gplOO, HER2/neu, EBV-LMP 1 & 2, HPV- 
F4, 6, 7, prostate-specific antigen (PSA), HPV-16, MUM, alpha-fetoprotein (AFP), 
C017-1 A, GA733, gp72, p53, the ras oncogene product, HPV El, Wihn's tumor 
antigen-1, telomerase, and melanoma gangliosides. Each of these antigens is known 
and has a coding, sequence that is publically available and well-known in the art. 

1 5 Further cancer antigens, whether later discovered or presently know can be expi«ssed ' ' --l 
by the present"pHSV vector as described herein. 

36. There are instances wherein it is advantageous to administer the vector of 
the invention in a pharmaceutical composition that comprises other vaccines. 
Pharamceutical compositions comprising multiple vaccines can be for therapeutic or t 

20 prophylactic purposes. An example Examples of such a composition is other vaccines 
include the mumps, measles, rubella (MMR) vaccine, and vaccines against M. 
tuberculosis, M. bovis, M. bovis sti-ain BCG, BCG substrains, M avium, M. 
intracellulare, M. africanum, M. kansasii. M. marinum, M. ulcerans. M. avium 
subspecies paratuberculosis. Nocardia asteroides, other Nocardia species, Legionella 

25 pneumophila, other Legionella species. Salmonella typhi, other Salmonella species, 

Shigella species. Yersinia pestis. Pasteurella haemolytica, Pasteurella multocida, other 
Pasteurella species, Actinobacillus pleuropneumoniae. Listeria monocytogenes. 
Listeria ivanovii. Brucella abortus, ottier Brucella species, Cowdria ruminantium. 
Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia psittaci, Coxiella burnetii, 

30 other Rickettsial species. Ehrlichia species. Staphylococcus aureus. Staphylococcus 
epidermidis. Streptococcus pyogenes. Streptococcus agalactiae. Bacillus anthracis. 
Escherichia coli. Vibrio cholerae, Campylobacter species, Neiserria meningitidis. 
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Neiserria gonorrhea, Pseudomonas aeruginosa, other Pseudomonas species, 
Haemophilus influenzae, Haemophilus ducreyi, other Hemophilus species, Clostridium 
tetani, other Clostridium species,Yersinia enterolitica, and other Yersinia species . 
Specifically contemplated and disclosed are pharmaceutical compositions comprising 
5 the vector of the invention and one or more additional vaccines. Also disclosed and 
herein contemplated are instances in which the vector of the invention comprises more 
than one antigen-encoding nucleic acid. In such a situation the vector of the invention 
will produce each antigen encoded in the vector as a separate antigen. 

37. There are instances in which a vector of the invention alone may not be 

10 suitable for a given purpose (eg., A kit designed to screen potential drugs for the 
treatment of a condition such kit, intended for use in laboratories without the 
capabilities to transfect a cell-line with the vector). In such cases, cells previously 
transfected with the vector of the invention are needed. Thus, also disclosed are cells 
comprising the vector of the invention. 

15 38. In an embodiment of the invention the antigen-encoding nucleic acid can 

encode a non-antigenic sequence of DNA. This sequence can provide a functional 
copy of a disrupted, hiutated, disregulated or deleted gene. Examples of nucleic acids : 
encoding proteins that play a role in genetic disorders are known in the literature 
relating to genetic disorders which is incorporated herein by reference. Methods of 

20 making these cells are described and exemplified herein and in the art. 

39. The ability to detect the presence of a construct can be a desireable featm*e 
of any vector. As such, vectors are often contain a marker to show that the construct of 
interest has been delivered to the subject (eg. a cell), and once delivered, is being 
expressed. A marker can take the form of a gene that is detectable when expressed. 

25 Thus, also disclosed are vectors of the invention fiarther comprising a reporter gene. 
One example of a reporter gene is green fluorescence protein (GFP). 
Compositions 

40. The invention includes a composition comprising a vector of the invention. 
Disclosed are the components to be used to prepare the disclosed compositions as well 

30 as the compositions themselves to be used within the methods disclosed herein. These 
and other materials are disclosed herein, and it is understood that when combinations, 
subsets, interactions, groups, etc. of these materials are disclosed tliat while specific 
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reference of each various individual and collective combinations and permutation of 
these compounds may not be explicitly disclosed, each is specijBcally contemplated and 
described herein. For example, if a particular spumavirus vector or antigen-encoding 
nucleic acid (e.g., pl7, p24, pl7/24, gag, gpl20) is disclosed and discussed and a 

5 mmaber of modifications that can be made to a nimiber of molecules including the 
spumavirus vector or antigen-encoding nucleic acid (e.g., pi 7, p24, pi 7/24, gag, 
gpl20) are discussed, specifically contemplated is each and every combination and 
permutation of spumavirus vector and antigen-encoding nucleic acid (e.g., pi 7, p24, 
pi 7/24, gag, gpl20) and the modifications that are possible imless specifically 

10 indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as 

well as a class of molecules D, E, and F and an example of a combination molecule, A- 
D is disclosed, then even if each is not individually recited each is individually and 
collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, 
and C-F are considered disclosed. Likewise, any subset or combination of these is also 

15 disclosed. Thus, for example, the sub-group of A-E, B-F, and C-E would be considered 
disclosed. This concept applies to all aspects of this application including, but not 
limited to, steps in methods of making and using the disclosed compositions. Thus, if 
there are a variety of additional steps that can be performed it is understood that each of 
these additional steps can be performed with any specific embodiment or combination 

20 of embodiments of the disclosed methods. Often therapeutic agents and vaccines are 
administered in formulations or combinations that incorporate other therapeutic 
modalities or necessary components for the purposes of time release, delivery, or 
augmentation of a response. It is understood and herein contemplated that the disclosed 
vectors and exogenous nucleic acid of the invention can be combined and administered 

25 with any such modality or component. 
Expression systems 

41. The nucleic acid vectors of the invention that are delivered to cells typically 
contain expression controlling systems for controlling the expression of 
heterologous/exogenous nucleic acid. For example, the inserted genes in viral and 

30 retroviral systems usually contain promoters, and/or enhancers to help control the 
expression of the desired gene product. A promoter is generally a sequence or 
sequences of DNA that function when in a relatively fixed location in regard to the 

11 
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transcription start site. A promoter contains core elements required for basic 
interaction of RNA polymerase and transcription factors, and may contain upstream 
elements and response elements. 

Viral Promoters and Enhancers 

5 42. Preferred promoters controlling transcription from vectors in mammalian 

host cells may be obtained from various sources, for example, the genomes of viruses 
such as: polyoma. Simian Virus 40 (SV40), adenovirus, retrovirases, hepatitis-B virus, 
cytomegalovirus and most preferably spumavirus, or from heterologous mammalian 
promoters, e.g. beta actin promoter. Each of these promoters in known and has a 

10 sequence that is publicly available. The early and late promoters of the SV40 virus are 
conveniently obtained as an SV40 restriction fragment which also contains the SV40 
viral origin of replication (Fiers et al.. Nature. 273: 113 (1978)). The immediate early 
promoter of the human cytomegalovirus is conveniently obtained as a HindmB 
restriction fragment (Greenway, P.J. et al.. Gene 18: 355-360(1982)). Of course, 

15 promoters from tiie host cell or related species also are usefril herein. Methods of 
ftinctionally linking promoters with coding sequences are well-known. 

43. Enhancer generally refers to a sequence of DNA that ftmctions at no fixed 
distance from the transcription start site and can be either 5' (Laimins, L. et al., Proc. 
Natl. Acad. Sci. 78: 993 (1981)) or 3' (Lusky, M.L., et al., Mol. Cell Bio. 3: 1 108 

20 (1983)) to the transcription unit. Furthermore, enhancers can be within an intron 

(Banerji, J.L. et al., Cell 33: 729 (1983)) as well as within the coding sequence itself 
(Osborne, T.F.. et al., Mol. Cell Bio. 4: 1293(1984)). They are usually between 10 
and 300 bp in length, and they ftmction in cis. Enhancers ftinction to increase 
transcription from nearby promoters. Enhancers also often contain response elements 

25 that mediate the regulation of transcription. Promoters can also contain response 

elements that mediate the regulation of transcription. Enhancers often determine the 
regulation of expression of a gene. While many enhancer sequences are now known 
from mammalian genes (globin, elastase, albumin, -fetoprotein and insiilin), typically 
one will use an enhancer from a eukaryotic cell virus for general expression. Preferred 

30 examples are the SV40 enhancer on the late side of the replication origin (bp 100-270), 
the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers. 

12 
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44. The promotor and/or enhancer may be specifically activated either by light 
or specific chemical events which trigger their function. Systems can be regulated by 
reagents such as tetracycline and dexamethasone. There are also ways to enhance viral 
vector gene expression by exposiire to irradiation, such as gamma irradiation, or 

5 alkylating chemotherapy drugs. 

45. In certain embodiments the promoter and/or enhancer region can act as a 
constitutive promoter and/or enhancer to maximize expression of the region of the 
transcription unit to be transcribed. In certain constructs the promoter and/or enhancer 
region be active in all eukaryotic cell types, even if it is only expressed in a particular 

10 type of cell at a particular time. A preferred promoter of this type is the CMV promoter 
(650 bases). Other preferred promoters are SV40 promoters, cytomegalovirus (full 
lengdi promoter), and retroviral vector LTR. 

46. It has been shown that all specific regulatory elements can be cloned and 
used to construct expression vectors that are selectively expressed in specific cell types 

15 such as melanoma cells. The glial fibrillary acetic protein (GFAP) promoter has been 
used to selectively express genes in cells of glial origin. 

47. Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) may also contain sequences necessary for the 
termination of transcription which may affect mRNA expression. These regions are 

20 transcribed as polyadenylated segments in the imtranslated portion of the mRNA 

encoding tissue factor protein. The 3' untranslated regions also include transcription 
termination sites. It is preferred that the transcription imit also contain a 
polyadenylation region. One benefit of this region is that it increases the likelihood that 
the transcribed unit will be processed and transported like mRNA. The identification 

25 and use of polyadenylation signals in e?q)ression constmcts is well established. It is 
preferred that homologous polyadenylation signals be used in the transgene constructs. 
In certain transcription units, the polyadenylation region is derived firom the SV40 early 
polyadenylation signal and consists of about 400 bases. It is also preferred that tiae 
transcribed imits contain other standard sequences alone or in combination with the 

30 above sequences improve expression fi-om, or stability of, the construct. 
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Markers 

48. The viral vectors can include nucleic acid sequence encoding a marker 
product (reporter gene). This marker product is used to determine if the gene has been 
delivered to Ihe cell and once delivered, is being expressed. Marker genes can be but 

5 are not limited to the E, Coli lacZ gene, which encodes fi-galactosidase, and green 
fluorescent protein. 

49. In some embodiments the marker may be a selectable marker. Examples of 
suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), 
thymidine kinase, neomycin, neomycin analog G418, hydromycin, and pxaromycin. 

10 When such selectable markers are successfully transferred into a mammalian host cell, 
the transformed mammahan host cell can survive if placed under selective pressure. 
There are two widely used distinct categories of selective regimes. The first category is 
based on a cell's metabolism and the use of a mutant cell line which lacks the ability to 
grow independent of a supplemented media. Two examples are: CHODHFR- cells 

IS and mouse LTK- cells. These cells lack the ability to grow without the addition of such 
nutrients as thymidine or hypoxanthine. Because these cells lack certain genes 
necessary for a complete nucleotide syntiiesis pathway, they cannot survive unless the 
missing nucleotides are provided in a supplemented media. An alternative to 
supplementing the media is to introduce an intact DHFR or TK gene into cells lacking 

20 the respective genes, thus altering their growth requirements. Individual cells which 
were not transformed with the DHFR or TK gene will not be capable of survival in 
non-supplemented media. 

50. The second category is dominant selection which refers to a selection 
scheme used in any cell type and does not require the use of a mutant cell line. These 

25 schemes typically use a dmg to arrest growth of a host cell. Those cells which have a 
novel gene woxild express a protein conveying drug resistance and would survive the 
selection. Examples of such dominant selection use the dmgs neomycin, (Southern P. 
and Berg, P. . J. Molec. ApdL Genet. 1: 327 (1982)), mycophenolic acid, (Mulligan, 
R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, (Sugden, B. et al., Mol. 

30 Cell. Biol. 5: 410-413 (1985)). The three examples employ bacterial genes under 
eukaryotic control to convey resistance to the appropriate drug G418 or neomycin 
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(geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others include the 
neomycin analog G4 1 8 and puramycin. 
Treatment and Prevention Methods 

51. By "treating" is meant an improvement in the disease state (i.e., viral 
5 infection, bacterial infection, parasitic infection, cancer, genetic disorder, or 

autoimmune disease) is observed and/or detected upon administration of a substance of 
the present invention to a subject. Treatment can range from a positive change in a 
symptom or symptoms of the disease to complete amelioration of the viral infection, 
bacterial infection, parasitic infection, or cancer (e.g., reduction in severity or intensity 

10 of disease, alteration of clinical parameters indicative of the subject's condition, relief 
of discomfort or increased or enhanced function), as detected by art-known techniques. 
The methods of the present invention can be utilized to treat an established viral 
infection, bacterial infection, parasitic infection, or cancer. One of skill in the art woxild 
recognize that viral infection, bacterial infection, parasitic infection, or cancer refer to 

15 conditions characterized by the presence of a foreign pathogen or abnormal cell growth. > 
Clinical symptoms will depend on the particular condition and are easily recognizable 
by those skilled in the art of treating the specific condition. Treatment methods can 
include, but are not limited to therapeutic vaccinations. Thus, Disclosed are methods of 
treating a subject with a condition comprising administering to the vector of the 

20 invention. 

52. Also disclosed are methods of the invention, wherein the condition being 
treated is a viral infection. The viral infection can be selected from the list of viruses 
consisting of Herpes simplex virus type-1. Herpes simplex virus type-2. 
Cytomegalovirus, Epstein-Barr virus. Varicella-zoster virus. Human herpesviras 6, 

25 Human herpesvims 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus. 
Hepatitis A virus. Hepatitis B virus. Hepatitis C virus. Hepatitis D vims. Hepatitis E 
virus, Rhinovirus, Coronavims, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavims, Human Papilomaviras, Respiratory syncytial virus. Adenovirus, 
Coxsackie virus. Dengue vims. Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 

30 virus. Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus. Eastern 
Equine Encephalitis vims, Japanese Encephalitis vims, St. Louis Encephalitis virus, 
Murray Valley fever vims. West Nile virus. Rift Valley fever vims. Rotavirus A, 

15 
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Rotavirus B, Rotavirus C, Sindbis virus, Simian Immunodeficiency cirus. Human T- 
cell Leukemia virus type-1, Hantavirus, Rubella virus. Simian Immunodeficiency virus, 
Human Immimodeficiency virus type-1, and Human Incmiunodeficiency virus type-2. 

53. Also disclosed are methods of the invention, wherein the antigen-encoding 
S nucleic acid is an antigen from a virus. The viral antigen can be selected from the 

group of viruses consisting of Herpes simplex virus type-1, Herpes simplex virus type- 
2, Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus, Himian herpesvirus 6, 
Human herpesvirus 7, Human herpesvirus 8, Variola virus, Vesicular stomatitis virus. 
Hepatitis A virus. Hepatitis B virus, Hepatitis C virus. Hepatitis D virus, Hepatitis E 

10 virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Hmnan Papilomavirus, Respiratory syncytial virus. Adenovirus, 
Coxsackie virus, Dengue virus. Mumps virus, Poliovirus, Rabies virus, Rous sarcoma 
virus. Yellow fever vims, Ebola virus, Marburg virus, Lassa fever virus. Eastern 
Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 

1 5 Murray Valley fever virus. West Nile virus. Rift Valley fever virus. Rotavirus A, 

Rotavirus B, Rotavims C, Sindbis virus. Simian Immimodeficiency cirus. Human T- 
cell Leukemia virus type-1. Hantavirus, Rubella virus. Simian Immimodeficiency vims. 
Human Immimodeficiency vims type-1, and Human Immunodeficiency vims type-2. 

54. Also disclosed are methods of the invention, wherein the condition being 
20 treated is a bacterial infection. The bacterial infection can be selected from the list of 

bacterium consisting of M tuberculosis, M, bovis, M bovis strain BCG, BCG 
substrains, M avium^ M. intracellulare, M. africanum, M kansasii, M. marinum, M 
ulcerans, M. avium subspecies paratuberculosis, Nocardia asteroides^ other Nocardia 
species, Legionella pneumophila^ other Legionella species. Salmonella typhi, other 

25 Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 

Pasteurella multocida, o^et Pasteurella species, Actinobacillus pleuropneumoniae. 
Listeria monocytogenes. Listeria ivanovii. Brucella abortus, oihcT Brucella species, 
Cowdria ruminantium^ Chlamydia pneumoniae^ Chlamydia trachomatis^ Chlamydia 
psittaciy Coxiella bumetti^ oihex Rickettsial species. Ehrlichia species. Staphylococcus 

30 aureus. Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus 

agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 

16 
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Pseudomonas species, Haemophilus influenzae^ Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia 
enterolitica, and other Yersinia species. 

55. Also disclosed are methods of the invention, wh^ein the antigen-encoding 
5 nucleic acid is an antigen from a bacterium. The bacterial antigen can be selected from 

the group consisting of M tuberculosis, M. bovis, M. bovis strain BCG, BCG 
substrains, M avium^ M intracellulare, M africanum, M. kansasii, M marinum, M, 
ulcerans, M avium subspecies paratuberculosis, Nocardia asteroides^ other Nocardia 
species, Legionella pneumophila^ otixor Legionella species, Salmonella typhi, other 

10 Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 

Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumoniae, 
Listeria monocytogenes, Listeria ivanovii, Brucella abortus, other Brucella species, 
Cowdria ruminantium^ Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaci, Coxiella bumetti, other Rickettsial species. Ehrlichia species. Staphylococcus 

15 aureus. Staphylococcus epidermidis. Streptococcus pyogenes. Streptococcus 

agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium spQcies,Yersinia enterolitica, 

20 and other Yersinia species, 

56. Also disclosed are methods of the invention, wherein the condition being 
treated is a parasitic infection. The parasitic infection can be selected from the list of 
parasites consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium vivax, 
Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, Trypanosoma 

25 cruzi, Leishmania major, oih&t Leishmania species.. Schistosoma mansoni, other 
Schistosoma species., and Entamoeba histolytica, 

57. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a parasite. The parasitic antigen can be selected from 
the group consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 

30 vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 
Trypanosoma cruzi, Leishmania major, other Leishmania species., Schistosoma 
mansoni, oih&x Schistosoma species., and Entamoeba histolytica. 
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58. Also disclosed are methods of the invention, wherein the condition being 
treated is cancer. 

59. The disclosed vectors and vector containing compositions can be used to 
treat any disease where uncontrolled cellular proliferation occurs such as cancers. A 
non-linaiting list of different types of cancers is as follows: lymphomas (Hodgkins and 
non-Hodgkins), leukemias, carcinomas, carcinomas of solid tissues, squamous cell 
carcinomas, adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastomas, 
neuroblastomas, plasmacytomas, histiocytomas, melanomas, adenomas, hj^oxic 
tumours, myelomas, AIDS-related lymphomas or sarcomas, metastatic cancers, or 
cancers in general. 

60. A representative but non-limiting list of cancers that the disclosed 
compositions can be used to treat is the following: lymphoma, B cell lymphoma, T cell 
lymphoma, mycosis fimgoides, HodgJdn*s Disease, myeloid leukemia, bladder cancer, 
brain cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma 
of head and neck, kidney cancer, lung cancers such as small cell lung cancer and non- 
small cell Ixmg cancer, neuroblastoma/gUoblastoma, ovarian cancer, pancreatic cancer, 
prostate cancer, skin cancer, liver cancer, melanoma, squamous ceU carcinomas of the 
mouth, throat, larynx, and lung, colon cancer, cervical cancer, cervical carcinoma, 
breast cancer, and epithelial cancer, renal cancer, genitourinary cancer, pulmonary 
cancer, esophageal carcinoma, head and neck carcinoma, large bowel cancer, 
hematopoietic cancers; testicular cancer; colon and rectal cancers, prostatic cancer, or 
pancreatic cancer. 

61. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is a tumor antigen. The tumor antigen can be selected from the list 
consisting of human epithelial cell mucin (Muc-1; a 20 amino acid core repeat for Muc- 
1 glycoprotein, present on breast cancer cells and pancreatic cancer cells), the Ha-ras 
oncogene product, p53, carcino-embryonic antigen (CEA), the raf oncogene product, 
gpl00/pmell7, GD2, GD3, GM2, TF, sTn, MAGE-1, MAGE-3, BAGE, GAGE, 
tyrosinase, gp75, Melan-A/Mart-1, gplOO, HER2/neu, EBV-LMP 1 & 2, HPV-F4, 6, 7, 
prostate-specific antigen (PSA), HPV-16, MUM, alpha-fetoprotein (AFP), C017-1 A, 
GA733, gp72, p53, the ras oncogene product, HPV E7, Wilm's tumor antigen-1, 
telomerase, and melanoma gangliosides. 
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62. Disclosed are methods of treating a condition in a subject comprising 
administering to the subject tiie vector of the invention, wherein tiie condition is due to 
a mutated, disregulated, disrupted, or deleted gene; autoimmimity; or inflammatory 
diseases. 

S 63. Disclosed are methods of treating a condition in a subject, wherein the 

condition can be selected from list consisting of cystic fibrosis, asthma, multiple 
sclerosis, muscular dystrophy, diabetes, tay-sachs, spinobifida, cerebral palsy, 
Parkinson's disease, lou gehrigg disease, alzheimer's, systemic lupus erythamatosis, 
hemophelia, Addsion's disease, Cushing's disease. 

10 64. By "preventing" is meant that after administration of a substance of the 

present invention to a subject, the subject does not develop the symptoms of the viral, 
bacterial, or parasitic infection, and/or does not develop the viral, bacterial, or parasitic 
infection. "Preventing" or "prevention" can also refer to the ultimate reduction of an 
infection, condition, or symptoms of an infection, or condition relative to infections or 

15 conditions in subjects that do not receive the substance. Methods of prevention can 
include, but are not limited to profilactic vaccination. As such, disclosed are methods 
of preventing an infection in a subject comprising administering to the subject the 
vector of the invention. 

65. Also "disclosed are methods of the invention, wherein the infection 

20 prevented is a viral infection. The viral infection can be selected from the list of 
viruses consisting of Herpes simplex virus type-1. Herpes simplex virus type-2, 
Cytomegalovirus, Epstein-Barr virus, Varicella-zoster virus. Human herpesvirus 6, 
Himian herpesvirus 7, Human herpesvirus 8, Variola virus, Vesicular stomatitis virus. 
Hepatitis A virus, Hepatitis B virus. Hepatitis C virus. Hepatitis D virus. Hepatitis E 

25 virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles virus, 
Polyomavirus, Human Papilomavirus, Respiratory syncj^ial virus. Adenovirus, 
Coxsackie vims. Dengue virus, Miraips virus, Poliovirus, Rabies virus, Rous sarcoma 
virus. Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus. Eastern 
Equine Encephalitis virus, Japanese Encephalitis vims, St. Louis Encephalitis virus, 

30 Murray Valley fever virus. West Nile vims. Rift Valley fever virus, Rotavims A, 

Rotavirus B, Rotavirus C, Sindbis vims. Simian Immunodeficiency cirus, Himian T- 
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cell Leukemia virus type-1. Hantavirus, Rubella virus, Simian Immunodeficiency virus, 
Htmian Immimodeficiency viras type-1, and Human Immunodeficiency vims type-2. 

66. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen firom a virus. The viral antigen can be selected fi:om the 

5 group consisting of Herpes simplex virus type-1. Herpes simplex virus type-2, 

Cytomegalovirus, Epstein-Barr virus. Varicella-zoster virus. Human herpesvirus 6, 
Human herpesvirus 7, Human herpesvims 8, Variola vims, Vesicular stomatitis virus. 
Hepatitis A virus. Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E 
vims, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles vims, 

10 Polyomavirus, Human Papilomavirus, Respiratory syncytial vims, Adenovims, 

Coxsackie vims. Dengue vims, Miunps vims, Poliovims, Rabies vims, Rous sarcoma 
vims. Yellow fever virus, Ebola vims, Marburg vims, Lassa fever vims, Eastern 
Equine Encephalitis virus, Japanese Encephalitis virus, St. Louis Encephalitis virus, 
Murray Valley fever virus. West Nile virus. Rift Valley fever virus, Rotavirus A,. 

1 5 Rotaviras B, Rotavirus C, Sindbis vims. Simian Immxmodeficiency cims. Human T- 
cell Leukemia vims type-1, Hantavims, Rubella vims. Simian Immunodeficiency vims. 
Human Immimodeficiency virus type-1, and Human Immunodeficiency vims type-2. 

67. Also disclosed are methods of the invention, wherein the infection 
prevented is a bacterial infection. The bacterial infection can be selected from the list 

20 of bacterium consisting of M tuberculosis, M bovis, M bovis strain BCG, BCG 

substrains, M avium, M. intracellulare, M qfricanum, M kansasii, M marinum, M 
ulcerans, M avium subspecies paratuberculosis, Nocardia asteroides, other Nocardia 
species, Legionella pneumophila^ other Legionella species. Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 

25 Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumoniae. 
Listeria monocytogenes. Listeria ivanovii. Brucella abortus, ofh&r Brucella species, 
Cowdria ruminantium^ Chlamydia pneumoniae^ Chlamydia trachomatis^ Chlamydia 
psittaciy Coxiella bumetti^ other Rickettsial species. Ehrlichia species, Staphylococcus 
aureus. Staphylococcus epidermidis, Streptococcus pyogenes. Streptococcus 

30 agalactiae. Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 

species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
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Hemophilus species, Clostridium tetanU other Clostridium species. Yersinia 
enterolitica, and other Yersinia species. 

68. Also disclosed are methods of the invention, wherein tiie antigen-encoding 
nucleic acid is an antigen from a bacterium. The bacterial antigen can be selected from 
5 the group consisting of M tuberculosis, M bovis, M bovis strain BCG, BCG 

substrams, M. avium, M intracellulare, M, africanum, M kansasii, M. marinum, M, 
ulcerans, M. avium subspecies paratuberculosis, Nocardia asteroides, oih&s Nocardia 
species, Legionella pneumophila, oihsx Legionella species, Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 
10 Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumonia^ 
Listeria monocytogenes, Listeria ivanovii. Brucella abortus, other Brucella species, 
Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaciy Coxiella bumetti, other Rickettsial species. Ehrlichia species. Staphylococcus 
aureus, Staphylococcus epidermidis, Str-eptococcus pyogenes. Streptococcus 
15 agalactiae, Bacillus anthracis, Escherichia coli. Vibrio cholerae, Campylobacter 

species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa,, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species, 7er.sinza enterolitica, 
and other Yersinia species. 
20 69. Also disclosed are methods of the invention, wherein the infection 

prevented is a parasitic infection. The parasitic infection can be selected from the list 
of parasites consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 
vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 
Trypanosoma cruzi, Leishmania major, oHiCT Leishmania species.. Schistosoma 
25 mansoni, other Schistosoma species., and Entamoeba histolytica. 

70. Also disclosed are methods of the invention, wherein the antigen-encoding 
nucleic acid is an antigen from a parasite. The parasitic antigen can be selected from 
the group consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 
vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei, 
30 Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica. 
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71. Also disclosed are methods of the inventioii, wherein the subject is a horse, 
cow, pig, dog, car, mouse, monkey, human, or a cell isolated from such an animal. 
Deliveiy of the compositions to cells 

72. There are a number of compositions and methods which can be used to 

5 deliver nucleic acids to cells, either in vitro or in vivo. For example, the nucleic acids 
can be delivered through a nimiber of direct delivery systems such as, electroporation, 
lipofection, calcium phosphate precipitation, and the present viral vectors. Appropriate 
means for transfection, including the present viral vectors, chemical transfectants, or 
physico-mechanical methods such as electroporation and direct diffusion of DNA, are 

10 described by, for example, Wolff, J. A., et al.. Science, 247, 1465-1468, (1990); and 

Wolff, J. A. Nature, 352, 815-818, (1991), Such methods are well known in the art and 
readily adaptable for use with the compositions and methods described herein. In 
certain cases, the methods will be modified to specifically function with large DNA 
molecules. Further, these methods can be used to target certain diseases and cell 

1 5 populations by using the targeting characteristics of the carrier. 

73. Transfer vectors can be any nucleotide constmction used to deliver genes 
into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part 
of recombinant retrovims or adenovims (Ram et al. Cancer Res. 53:83-88, (1993)). 

74. As used herein, viral vectors such as pHSV are agents that transport the 
20 disclosed antigen-encoding nucleic acids, such as pi 7/24 into the cell without 

degradation and include a promoter yielding expression of the gene in the cells into 
which it is delivered. Retroviral vectors especially spumavirus vectors are able to carry 
a larger genetic pay load, i.e., a transgene or marker gene, than other viral vectors. A 
preferred embodiment is a viral vector which has been engineered so as to induce the 
25 immune response of the host organism, elicited by the peptides encoded on the vector. 

75. Viral vectors can have higher transaction abilities (ability to introduce 
genes) than chemical or physical methods to introduce genes into cells. Typically, viral 
vectors contain, nonstructural and structural genes, a polymerase, inverted terminal 
repeats necessary for replication and encapsidation, and promoters to control the 

30 transcription and replication of the viral genome. When engineered as vectors, vimses 
typically have one or more of the early genes removed and a gene or gene/promotor 
cassette is inserted into the viral genome in place of the removed viral DNA. 

22 



wo 2004/003153 



tT/US2003/020325 



Constructs of this type can carry large fragments of foreign genetic material. The 
necessary functions of the removed early genes are typically supplied by cell lines 
which have been engineered to express the gene products of the early genes in trans. 

76. Spumaviruses are retroviruses. A retrovirus is an animal virus belonging to 

5 the virus femily of Retroviridae, including any types, subfamilies, genus, or tropisms. 
Retroviral vectors, in general, are described by Verma, I.M., Retroviral vectors for gene 
transfer. In Microbiology- 1985, American Society for Microbiology, pp. 229-232, 
Washington, (1985), which is incorporated by reference herein. Examples of methods 
for using retroviral vectors for gene therapy are described in U.S. Patent Nos. 4,868,1 16 

10 and 4,980,286; PCX applications WO 90/02806 and WO 89/07136; and Mulligan, 
(Science 260:926-932 (1993)); the teachings of which are incorporated herein by 
reference. Although the present spumavirus vector is unique, the methods described for 
using other types of viral vectors can be useful in certain contexts. See for example 
U.S. Patent No. 5,646,032, which is incorporated herein for its teaching of those 

15 methods. 

77. A retrovirus is essentially a package which has packed into it nucleic acid 
cargo. The nucleic acid cargo carries with it a packaging signal, which ensures that the 
replicated daughter molecules will be efELciently packaged within the package coat. In 
addition to the package signal, there are a number of molecules which are needed in cis, 

20 for the replication, and packaging of the replicated virus. Typically a retroviral 

genome, contains the gag, pol, and env genes which are involved in the making of the 
protein coat. It is the gag, pol, and env genes which are t>T)ically replaced by the 
foreign DNA that is to be transferred to the target cell. Retrovirus vectors typically 
contain a packaging signal for incorporation into the package coat, a sequence which 

25 signals the start of the gag transcription unit, elements necessary for reverse 

transcription, including a primer binding site to bind the tRNA primer of reverse 
transcription, terminal repeat sequences that guide the switch of RNA strands during 
DNA synthesis, a purine rich sequence 5' to the 3' LTR that serves as the priming site 
for the synthesis of the second strand of DNA synthesis, and specific sequences near 

30 the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to 
insert into the host genome. The removal of the gag, pol, and env genes allows for 
large firagments of foreign sequence to be inserted into the viral genome, become 
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reverse transcribed, and upon replication be packaged into a new retroviral particle. 
This amount of nucleic acid is sufficient for the delivery of a one to n:iany genes 
dependiag on the size of each transcript. It is preferable to include either positive or 
negative selectable markers along with other genes in tiie insert. 

5 78. A packaging cell line is a cell line which has been transfected or 

transformed witii a retrovirus that contains the replication and packaging machinery, 
but lacks any packaging signal. When the vector carrying the DNA of choice is 
transfected into these cell lines, the vector containing the gene of interest is replicated 
and packaged into new retroviral particles, by the machinery provided in cis by the 

10 helper cell. The genomes for the machinery are not packaged because they lack the 
necessary signals. 

In vivo/ex vivo 

79. As described herein, the vector-containing compositions can be 
administered in a pharmaceutically acceptable carrier and can be delivered to the 

15 subjectDs cells in vivo and/or ex vivo by a variety of mechanisms well known in the art 
(e.g., uptake of naked DNA, liposome fusion, intramuscular injection of DNA via a 
gene gun, endocytosis and tiie like). 

80. If ex vivo methods are employed, cells or tissues can be removed and 
maintained outside the body according to standard protocols well known in the art. The 

20 compositions can be introduced into the cells via any gene transfer mechanism, such as, 
for example, calcium phosphate mediated gene delivery, electroporation, 
microinjection or proteoliposomes. The transduced cells can then be infused (e.g., in a 
pharmaceutically acceptable carrier) or homotopically transplanted back into the 
subject per standard methods for the cell or tissue type. Standard methods are known 

25 for transplantation or infusion of various cells into a subject. 
Nucleic Acid Delivery 

81. In the methods described above which include the administration and uptake 
of exogenous DNA into the cells of a subject (i.e., gene transduction or transfection), 
the nucleic acids of the present invention can be in the form of naked DNA or RNA, or 

30 the nucleic acids can be in a vector for delivering the nucleic acids to the cells, whereby 
the antibody-encoding DNA firagment is under the transcriptional regulation of a 
promoter, as would be well imderstood by one of ordinary skill in the art. 
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82. As one example, if the antigen-encoding nucleic acid of the invention is 
delivered to the cells of a subject in a spumavirus vector, the dosage for administration 
of spumavirus to humans can range from about 10^ to 10^ plaque forming units (pfu) 
per injection but can be as high as lO'^ to 10^^ pfu per injection. In some cases lower 

5 dosages (eg., 10^, 10"*, 10^, and 10^ pfu) can be effective. A subject can receive a single 
injection, or, if additional injections are necessary, they can be repeated at two, four, six 
month intervals (any intervening time intervals or other appropriate time intervals, as 
determined by the skilled practitioner) for an indefinite period and/or imtil the efficacy 
of the treatment has been established. 

10 83. Parenteral administration of the nucleic acid or vector of the present 

' invention, if used, is generally characterized by injection. Injectables can be prepared 
* in conventional forms, either as liquid solutions or suspensions, solid forms suitable for 
I solution of suspension in liquid prior to injection, or as emulsions. A more recently 
revised approach for parenteral administration involves use of a slow release or 

15 sustained release system such that a constant dosage is maintained. See, e.g., U.S. 
Patent No. 3,610,795, which is incorporated by reference herein. For additional 
discussion of suitable formulations and various routes of administration of therapeutic 
compounds, see, e.g.. Remington: The Science and Practice of Pharmacy (19th ed.) ed. 
A.R. Gennaro, Mack Publishing Company, Easton, PA 1995. 

20 Delivery of pharmaceutical products 

84. The vector or vector-containing compositions can be administered in vivo in 
a pharmaceutically acceptable carrier. By "pharmaceutically acceptable" is meant a 
material that is not biologically or otherwise undesirable, i.e., the material may be 
administered to a subject, along with the nucleic acid or vector, without causing any 

25 undesirable biological effects or interacting in a deleterious manner with any of the 
other components of the pharmaceutical composition in which it is contained. The 
carrier would naturally be selected to minimize any degradation of the active ingredient 
and to minimize any adverse side effects in the subject, as would be well known to one 
of skill in the art. 

30 85. The compositions may be administered orally, parenterally (e.g., 

intravenously), by intramuscular injection, by intraperitoneal injection, subcutaneously, 
transdermally, extracorporeally, topically, gene gun or the like, although topical 

25 



wo 2004/003153 ^PCT/US2003/020325 



intranasal administration or administration by inhalant is typically preferred. As used 
herein, "topical intranasal administration" means delivery of the compositions into the 
nose and nasal passages through one or both of the nares and can comprise delivery by 
a spraying mechanism or droplet mechanism, or through aerosolization of the nucleic 

5 acid or vector. The latter may be effective when a large nimiber of animals is to be 
treated simultaneously. Administration of the compositions by inhalant can be through 
the nose or mouth via delivery by a spraying or droplet mechanism. Delivery can also 
be directly to any area of the respiratory system (e.g., lungs) via intubation. The exact 
amoxmt of the compositions required will vary from subject to subject, depending on 

10 the species, age, weight and general condition of the subject, the severity of the allergic 
disorder being treated, the particular nucleic acid or vector used, its mode of 
administration and the like. Thus, it is not possible to specify an exact amount for 
every composition. However, an appropriate amount can be determined by one of 
ordinary skill in the art using only routine experimentation given the teachings herein, 

15 86. The materials may be in solution, suspension (for example, incorporated - 

into microparticles, liposomes, or cells). These may be targeted to a particular cell type 
via antibodies, receptors, or receptor ligands. The ifoUowing references are examples of 
the use of this technology to' target specific proteins to tumor tissue (Senter, et al., 
Bioconiugate Chem., 2:447-451, (1991); Bagshawe, K.D., Br, J. Cancer, 60:275-281, 

20 (1989); Bagshawe, et al, Br. J. Cancer , 58:700-703, (1988); Senter, et al, Bioconiugate 
Cheiru, 4:3-9, (1993); Battelli, et al.. Cancer Immunol. Immunother.> 35:421-425, 
(1992); Pietersz and McKenzie, Immunolog. Reviews, 129:57-80, (1992); and Roffler, 
et al., Biochem. Pharmacol. 42:2062-2065, (1991)). Vehicles such as "stealth" and 
other antibody conjugated liposomes (including lipid mediated drug targeting to colonic 

25 carcinoma), receptor mediated targeting of DNA through cell specific ligands, 

lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting 
of murine glioma cells in vivo. The following references are examples of Ihe use of this 
technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research. 
49:6214-6220, (1989); and Litzinger and Huang, Biochimica et Biophvsica Acta. 

30 1 104: 179-187, (1992)). In general, receptors are involved in pathways of endocytosis, 
either constitutive or ligand induced. These receptors cluster in clathrin-coated pits, 
enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which 
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the receptors are sorted, and then either recycle to the cell surface, become stored 
iatracellularly, or are degraded in lysosomes. The internalization pathways serve a 
variety of functions, such as nutrient uptake, removal of activated proteins, clearance of 
macromolecules, opportunistic entry of viruses and toxins, dissociation and degradation 

5 of ligand, and receptor-level regulation. Many receptors follow more than one 
intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms 
of receptor-mediated endocytosis has been reviewed (Brown and Greene, DN A and 
Cell Biology 10:6,399-409(1991)). 

10 Pharmaceutically Acceptable Carriers 

87. The compositions, can be used therapeutically in combination with a 
pharmaceutically acceptable carrier. 

88. Pharmaceutical carriers are known to those skilled in the art. These most 
typically would be standard carriers for administration of drugs to humans, including 

15 solutions such as sterile water, saline, and buffered solutions at physiological pH. The 
compositions can be administered intramuscularly or subcutaneously. Other 
compounds will be administered according to standard procedures used by those skilled 
in the art. 

89. Pharmaceutical compositions may include carriers, thickeners, diluents, 
20 buffers, preservatives, surface active agents and the like in addition to the molecule of 

choice. Pharmaceutical compositions may also include one or more active ingredients 
such as antimicrobial agents, anti-inflammatory agents, anesthetics, and the like. 

90. The pharmaceutical composition may be administered in a number of ways 
depending on whether local or systemic treatment is desired, and on the area to be treated. 

25 Administration may be topically (including ophthahnically, vaginally, rectally, 

intranasally), orally, by inhalation, or parenterally, for example by intravenous drip, 
subcutaneous, intraperitoneal or intramuscular injection. The disclosed compositions can 
be administered intravenously, intraperitoneally, intramuscularly, subcutaneously, 
intracavity, by gene gun, or transdermally. 

30 91. Preparations for parenteral administration include sterile aqueous or non- 

aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are 
propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable 
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organic esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous 
solutions, emulsions or suspensions, including saline and buffered me^^^ Parenteral ^ 
vehicles include sodium chloride solution. Ringer's dextrose, dextrose and sodium 
chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient 
5 replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), and the 
like. Preservatives and other additives may also be present such as, for example, 
antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

92. Formulations for topical administration may include ointments, lotions, 
creams, gels, drops, suppositories, sprays, liquids and powders. Conventional 

10 pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be 
necessary or desirable, 

93. Compositions for oral administration include powders or granules, suspensions 
or solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, 
flavorings, diluents, emulsifiers, dispersing aids or binders may be desirable. 

15 94. Some of the compositions may potentially be administered as a 

pharmaceutically acceptable acid- or base- addition salt, formed by reaction with 
inorganic acids such as hydrochloric acid, hydrobromic acid, perchloric acid, nitric 
acid, thiocyanic acid, sulfuric acid, and phosphoric acid, and organic acids such as 
formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic 

20 acid, malonic acid, succinic acid, maleic acid, and fumaric acid, or by reaction with an 
inorganic base such as sodium hydroxide, ammonium hydroxide, potassium hydroxide, 
and organic bases such as mono-, di-, trialkyl and aryl amines and substituted 
ethanolamines. 

Therapeutic Uses 

25 95. The substances of the present invention can be delivered at effective 

amounts or concentrations. An effective concentration or amount of a substance is one 
that results in treatment or prevention of the condition (e.g. HIV or AIDS). One skilled 
in the art would know how to determine an effective concentration or amoimt according 
to methods known in the art, as well as provided herein. One of skill in the art can 

30 utilize in vitro assays to optimize the in vivo dosage of a particular substance, including 
concentration and time course of administration. 
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96. Herein, immune modulatory substances are described. As defined immune 
modulation refers to any change in the immime response. This includes but is not 
limited to increase or decreases in the number of antigen specific plasma cells, memory 
B cells, memory T cells, activated CDS T cells, cytokine production, and cytolitic 

5 killing or the maintenance of homeostatic levels of the same when the responses would 
otherwise be changing. The dosage ranges for the administration of the substances are 
those large enough to produce the desired effect in which the symptoms of the disorder 
are affected. The dosage should not be so large as to cause adverse side effects, such as 
xmwanted cross-reactions, anaphylactic reactions, and the like. Generally, the dosage 

10 will vary with the age, condition, sex and extent of the disease in the patient and can be 
determined by one of skill in the art. The dosage can be adjusted by the individual 
physician in the event of any contraindications. Dosage can vary, and can be 
administered in one or more dose administrations daily, for one or several days. 

97. For example, to evaluate the efficacy of treatment of humans with a 

15 condition, such as for example, HTV, with a substance that modulates normal immune 
responses to HTV, the following studies can be performed. Patients with active 
infection can be selected. Drug efficacy can be monitored via viral titer, antibody 
ELISA or ELISPOT and CD4 T cell count. Patients can be randomized to two different 
protocols. In one protocol, subjects can remain on initial medication and in the second 

20 protocol, subjects can have their medication tapered after receiving the substance that 
modulates immime responses. 

98. In one embodiment, treatment can consist of either a single dosage of 0.3 
mg to 0.6 mg/animal of the vector expressing a substance that treats or prevents the 
condition. Additionally, dosage as low as 30[xg/animal to SOO^g/animal can be used 

25 with intramuscular injection of infecflious DNA. In one example, a DNA 

immunization comprising 0.3 mg of the pFOV-gag vector is administered to mouse via 
gene gun. After two weeks the subject is monitored using conventional assays to assess 
a generated immime response to pl7/24 in the form of antibody production and antigen 
specific T cells. Having generated a response, the mouse is then challenged with an 

30 infectious dose of HIV. The mouse is then monitored for disease progression and 
clinical symptoms associated with HTV. Alternatively, a mouse can be infected with 
HIV and then given a therapeutic dose of the pFOV-gag. The animal is monitored for 
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changes in the viral lode, clinical progression of symptoms, as well as immime 
responses in comparison to non treated control animals. In a further example, a DNA 
immunization comprising O.OSmg (SO^ig) of the pFOV-gag vector is administered to 
mouse via intrauscular injection of infectious DNA. After two weeks the subject is 

5 monitored using conventional assays to assess a generated immune response to pl7/24 
in the form of antibody production and antigen specific T cells. Having generated a 
response, the mouse is then challenged with an infectious dose of HIV. The mouse is 
then monitored for disease progression and clinical symptoms associated with HIV. 
Alternatively, a mouse can be infected with HIV and then given a therapeutic dose of 

10 the pFOV-gag. The animal is monitored for changes in the viral lode, clinical 

progression of symptoms, as well as immxme responses in comparison to non treated 
control animals. The art of determining dosage for an animal based on size is well 
known. It is imderstood that a skilled artisan would be able to determine the proper 
dosage of a substance for an animal based on the dosage of the same substance 

IS administered to another animal of similar or different size. For example, wheras a 200g 
mouse would receive a 50ng dose of the immunization, a 1501b (68kg) human would 
be administered an immunization comprising about 17mg of the vector. 

99. Disclosed are methods of detecting the expression of the vector of the 
invention comprising using a first antibody to the antigen to measure protein expression 

20 in a quantitative or qualitative way, further comprising detecting the first antibody 

directly via a colorimetric measurement produced through the use of a substrate and a 
conjugated antibody or indirectly via a firstantibody to the antigen which in tum is 
bound by a second antibody which is conjugated and will result in a colorimetric 
measurement when combined with a substrate. 

25 100. Also disclosed are methods of the invention, wherein the antigen is 

detected by placing an aliquot of the vector of the invention in a lane on a gel and 
probing the gel for the antigen. 

101. Disclosed are methods of detecting the expression of the vector of the 
invention comprising using a fluorescently labeled first antibody specific for the 

30 antigen and visualizing the antigen using a flow cytometer or fluorescence microscope. 

102. Also disclosed are methods of the invention, wherein the first antibody 
is not flourescently labeled, but a target for a second antibody with a fluorescent label. 
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103. Disclosed are methods of detecting the expression of the vector of claim 
the invention comprising using cytolitic killing assay to assess activity. 

104. Disclosed are methods of detecting the vector of the invention further 
comprising obtaining a sample from a subject comprising a tissue biopsy or removal of 

5 blood or bone marrow. 
Screening Methods 

105. Also provided by the present invention is a method of screening a 
substance for effectiveness in treating or reducing the severity of the condition (e.g., 
HIV infection) comprising: a) obtaining an animal having the condition or 

10 characteristic (e.g., symptom) of the condition; b) administering the substance to an 

animal having one or more characteristics of the condition; and assaying the animal for 
an effect on the condition, thereby identifying a substance effective in reducing the 
condition. 

106. The ability of a substance to reduce the severity of a condition can be 
IS determined by evaluating the histological and clinical manifestations, of tbe animal 

with condition before and after administration of the substance of interest and 
quantitating the amoimt of reduction of the condition. 

107. The animal in which the condition or characteristic (e.g., symptom) of 
the condition is produced can be any mammal and can include but is not limited to 

20 mouse, rat, guinea pig, hamster, rabbit, cat, dog, goat, monkey, and chimpanzee. The 
condition or characteristic (e.g., symptom) of the condition can be produced in the 
animal by any method known in the art. For example, HIV can be produced by 
introducing into the animal (eg., chimpanzee infected with HIV or rhesus macaques or 
nemestrina macaques infected with an HIV-1 env on an SIV backbone, PuUium , JK, et. 

25 al., J. Infectious Dis. 183: 1023, 2001) an infectious amount of HIV. 

108* The present invention also provides a method of screening for a 
substance effective in preventing the condition (e.g., HTV infection) comprising: a) 
adndnistering the substance to an animal susceptible to the condition; b) subjecting the 
animal to treatment that will induce the condition or characteristic (e.g., symptom) of 

30 the condition; and c) assaying cells from the animal for an change in immune responses 
as compared to an the immune responses in a control animal having the condition in the 
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absence of the substance identifies a substance that is effective in preventing the 
condition. 

109. For example, the methods of measuring the amoxmt of pl7/24 or other 
HIV antigen in an animal include, but are not limited to, ELIS A, PGR, FACS analysis, 

5 reverse-transcriptase-polymerase chain reaction and ELISPOT, Northern blots, 
Southem blots, and Western blots. 

110. A model for use in screening for substances effective in treating or 
preventing a disease comprising an animal capable of manifesting a characteristic of the 
disease is provided, wherein the animal has been administered the vector of the 

10 invention. 

111. A method of making the model of the invention comprising obtaining 
administering to an animal capable of manifesting a characteristic of the disease and 
administering to said animal the vector of claim la vector of the invention which 
encodes an antigen associate with the disease. 

15 1 1 2. A method of screening for a substance effective in treating a disease 

associated with an immunizing construct comprising: a) administering the substance to ^ 
the model of tiie invention; and b) assaying for an change in the course of the disease as 
compared to an the course of the disease in a control subject; an improvement in the 
covirse of the disease in the presence of the substance identifies a substance that is 

20 effective in treating the disease. 

113. A method of screening for a substance effective in preventing a disease 
associated with an immunizing construct comprising: a) administering a the vector of 
the invention to a subject; b) subjecting the subject to treatment that will induce the 
disease or characteristic (e.g., symptom) of the disease; and c) assaying for an change 

25 in the course of the disease as compared to an the course of the disease in a control 

subject; an improvement in the course of the disease in the presence of the substance identifies a 
substance that is effective in preventing the disease. 

114. A method of screening for a substance effective in treating a disease 
associated with an immunizing construct comprising: a) subjecting the subject to 

30 treatment that will induce the disease or characteristic (e.g., symptom) of the disease; b) 
administering a the vector of the invention to a subject; and c) assaying for an change in 
the course of the disease as compared to an the course of the disease in a control 
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subject; an improvement in the course of the disease in the presence of the substance identifies a 
substance that is effective in treating the disease. 
Sequence similarities 

1 15. It is iinderstood that as discussed herein the use of the terms homology 
5 and identity mean the same thing as similarity. Thus, for example, if the use of the 

word homology is used between two non-natural sequences it is understood that this is 
not necessarily indicating an evolutionary relationship between these two sequences, 
but rather is looking at the similarity or relatedness between their nucleic acid 
sequences. Many of the methods for determining homology between two 
10 evolutionarily related molecules are routinely applied to any two or more nucleic acids 
or proteins for the purpose of measuring sequence similarity regardless of whether they 
are evolutionarily related or not. 

116. In general, it is understood that one way to define any known variants 
and derivatives or those that might arise, of the disclosed genes and proteins herein, is 

15 through defining the variants and derivatives in terms of homology to specific known 
sequences. This identity of particular sequences disclosed herein is also discussed 
elsewhere herein. In general, variants of genes and proteins herein disclosed typically 
have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 
88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence 

20 or the native sequence. Those of skill in the art readily understand how to determine 
the homology of two proteins or nucleic acids, such as genes. For example, the 
homology can be calculated after aligning the two sequences so that the homology is at 
its highest level. 

117. Another way of calculating homology can be performed by published 
25 algorithms. Optimal alignment of sequences for comparison may be conducted by the 

local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 
(1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. 
Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms 
30 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection. 
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118. The same types of homology can be obtained for nucleic acids by for 
example the algorithn:is disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. 
Proc. Natl Acad. Set USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol 
183:281-306, 1989 which are herein incorporated by reference for at least material 

S related to nucleic acid alignment. It is understood that any of the methods typically can 
be used and that in certain instances the results of these various methods ntiay differ, but 
the skilled artisan xmderstands if identity is foxmd with at least one of these methods, 
the sequences would be said to have the stated identity, and be disclosed herein. 

119. For example, as used herein, a sequence recited as having a particular 
10 percent homology to another sequence refers to sequences that have the recited 

homology as calculated by any one or more of the calculation methods described 
above. For example, a first sequence has 80 percent homology, as defined herein, to a 
second sequence if the first sequence is calculated to have 80 percent homology to the 
second sequence using the Zuker calculation method even if the first sequence does not 

15 have 80 percent homology to the second sequence as calculated by any of the other 

calculation methods. As another example, a first sequence has 80 percent homology, as 
defined herein, to a second sequence if the first sequence is calculated to have 80 
percent homology to the second sequence using both the Zuker calculation method and 
the Pearson and Lipman calculation method even if the first sequence does not have 80 

20 percent homology to the second sequence as calculated by the Smith and Waterman 
calculation method, the Needleman and Wxmsch calculation method, the Jaeger 
calculation methods, or any of the other calculation methods. As yet another example, 
a first sequence has 80 percent homology, as defined herein, to a second sequence if the 
first sequence is calculated to have 80 percent homology to the second sequence using 

25 each of calculation methods (although, in practice, the different calculation methods 
will often result in different calculated homology percentages). 
Nucleic acids 

120. There are a variety of molecules disclosed herein that are nucleic acid 
based, including for example the nucleic acids that encode, for example HIV-1 gag^ as 
30 well as various functional nucleic acids. The disclosed nucleic acids are made up of for 
example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting 
examples of these and other molecules are discussed herein. It is understood that for 
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example, when a vector is expressed in a cell, that the expressed niElNA will typically 
be made up of A, C, G, and U. Likewise, it is understood that if, for example, an 
antisense molecule is introduced into a cell or cell environment flurough for example 
exogenous delivery, it is advantageous that tiie antisense molecule be made up of 
nucleotide analogs that reduce the degradation of the antisense molecule in the cellular 
environment. 

Sequences 

121. There are a variety of sequences related to the gag gene that are publicly 
available (eg., Genbank Accession Number: LOS 707), these sequences and others are 
herein incorporated by reference in their entireties as well as for individual 
subsequences contained therein. 

122. One particular sequence set forth in gag that is publicly available and 
having Genbank accession mmiber L03707 is used herein, as an example, to exemplify 
the disclosed compositions and methods. It is understood that the description related to 
this sequence is applicable to any sequence related to gag unless specifically indicated 
otherwise. Those of skill in the art understand how to resolve sequence discrepancies 
and differences and to adjust the compositions and metiiods relating to a particular 
sequence to other related sequences (i,e. sequences of gag). Primers and/or probes can 
be designed for any gag sequence given the information disclosed herein and known in 
the art. 

Peptides/ Protein variants 

123. There are niraierous variants of protein antigens that are antigenic. For 
example, there are variants of the gag protein and protein that are known and herein 
contemplated. In addition to the known functional HIV-1 gag strain variants there are 
derivatives of the gag proteins which also function in the disclosed methods and 
compositions. Protein variants and derivatives are well understood to those of skill in 
the art and in can involve amino acid sequence modifications. For example, amino acid 
sequence modifications typically fall into one or more of three classes: substitutional, 
insertional or deletional variants. Insertions include amino and/or carboxyl terminal 
fusions as well as intrasequence insertions of single or mtdtiple amino acid residues. 
Insertions ordinarily will be smaller insertions than those of andno or carboxyl terminal 
fusions, for example, on the order of one to four residues. Immunogenic fusion protein 
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derivatives, such as those described in the examples, are made by fusing a polypeptide 
sufficiently large to confer immunogenicity to the target sequence by cross-linking in 
vitro or by recombinant cell culture transformed with DNA encoding the fusion. 
Deletions are characterized by the removal of one or more amino acid residues from the 

5 protein sequence. Typically, no more than about from 2 to 6 residues are deleted at any 
one site within the protein molecule. These variants ordinarily are prepared by site 
specific mutagenesis of nucleotides in the DNA encoding the protein, thereby 
producing DNA encoding the variant, and thereafter expressing the DNA in 
recombinant cell culture. Techniques for making substitution mutations at 

10 predetermined sites in DNA having a known sequence are well known, for example 

Ml 3 primer mutagenesis and PGR mutagenesis, Amino acid substitutions are typically 
of single residues, but can occur at a number of different locations at once; insertions 
usually will be on the order of about from 1 to 10 amino acid residues; and deletions 
will range about from 1 to 30 residues. Deletions or insertions preferably are made in 

15 adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. Substitutions, 
deletions, insertions or any combination thereof may be combined to arrive at a final 
construct. The mutations must not place the sequence out of reading frame and 
preferably will not create complementary regions that could produce secondary mRNA 
structure. Substitutional variants are those in which at least one residue has been 

20 removed and a different residue inserted in its place. Such substitutions generally are 
made in accordance with the following Tables 1 and 2 and are referred to as 
conservative substitutions. 
TABLE 1 : Amino Acid Abbreviations 



Amino Acid 


Abbreviations 


Alanine 


Ala, A 


AUosoleucine 


Alle 


Arginine 


Arg,R 


Asparagine 


Asn,N 


aspartic acid 


Asp, D 


Cysteine 


Cys,C 


glutamic acid 


Glu,E 


Glutamine 


Ghi,0 


Glycine 


Gly,G 


Histidine 


His,H 


Isolelucine 


He, I 


Leucine 


Leu, L 
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Proline 


Pro P 


pyroglutamic acidp 


Glu 


Serine 


Ser, S 


Threonine 


Thr.T 


Tyrosine 


Tyr.Y 


Tryptophan 


Trp, W 


Valine 


Val, V 



TABLE 2:Amino Acid Substitutions 

Original Residue Exemplary Conservative Substitutions, others are known 

in the art. 

Ala; Ser 

Arg; Lys, ghi 

Asn; Gin; his 

Asp; Glu 

Cys; Ser 

Gin; Asn, lys 

Glu; Asp 

Gly; Pro 

His; Asn; gin 

He; Leu; val 

Leu; He; val 

Lys; Arg; g in; 

Met; Leu; ile 

Phe; Met; leu; tyr 

Ser; Thr 

Thr; Ser 

Trp; Tyr 

Tyr; Trp; phe 

Val; Ile; leu 



124. Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those in Table 2, i.e., selecting 

5 residues that differ more significantly in their effect on maintaining (a) the structure of 
the polypeptide backbone in the area of the substitution, for example as a sheet or 
helical conformation, (b) the charge or hydrophobicity of the molecule at the target site 
or (c) the bulk of the side chain. The substitutions which in general are expected to 
produce the greatest changes in the protein properties will be those in which (a) a 

10 hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
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residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side 
chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative 
residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., 
S phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in 
tixis case, (e) by increasmg the number of sites for sulfation and/or glycosylation. 

125. The replacement of one amino acid residue with another that is 
biologically and/or chemically similar is known to those skilled in the art as a 
conservative substitution. For example, a conservative substitution would be replacing 
10 one hydrophobic residue for another, or one polar residue for another. The 

substitutions include combinations such as, for example, Gly, Ala; Val, He, Leu; Asp, 
Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively substituted 
variations of each expUcitly disclosed sequence are included within the antigenic 
polypeptides encoded and expressed by the vectors provided herein. 
1 5 126. Substitutional or deletional mutagenesis can be employed to insert sites 

for N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of 
cysteine or other labile residues also may be desirable. Deletions or substitutions of 
potential proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the 
basic residues or substituting one by glutaminyl or histidyl residues, 
20 127. Certain post-translational derivatizations are the result of the action of 

recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl 
residues are frequently post-translationally deamidated to the corresponding glutamyl 
and aspaiyl residues. Alternatively, these residues are deamidated under mildly acidic 
conditions. Other post-translational modifications include hydroxylation of proline and 
25 lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of 
the o-amino groups of lysine, arginine, and histidine side chains (T.E. Creighton, 
Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 
79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of 
the C-terminal carboxyl. 
30 128. It is understood that one way to define the variants and derivatives of the 

disclosed proteins herein is through defining the variants and derivatives in terms of 
homology/identity to specific known sequences. For example, SEQ ID NO:3 sets forth 
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a particular sequence of a gag protein. Specifically disclosed are variants of these and 
other proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% 
or 95% homology to the stated sequence. It is understood that tiiose of skill in the art 
will recognize variants of a disclosed protein as being a variant. Particularly, tiiere are 

5 nximerous naturally occurring variants of viral (e.g., HIV) antigens. For exanoiple, it is 
understood that those of skill in the art would recognize that the particular gag protein 
disclosed in SEQ ID NO: 4 is a gag protein from a different HTV-l isolate from the 
HIV-1 gag protein in SEQ ID NO: 3 and therefore a variant that can be used with 
equivalent results. Those of skill in the art readily understand how to determine the 

10 homology of two proteins. For example, the homology can be calculated after aligning 
the two sequences so that the homology is at its highest level. 

129. Another way of calculating homology can be performed by published 
algorithms. Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman Adv. AppL Math. 2: 482 (1981), by 

15 the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 
(1970), by the search for similarity method of Pearson and Lipman, Proc. Natl, Acad. 
Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection. 

20 130. The same types of homology can be obtained for nucleic acids by for 

example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. 
Proc. Natl, Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 
183:281-306, 1989 which are herein incorporated by reference for at least material 
related to nucleic acid alignment. 

25 131. It is understood that the description of conservative mutations and 

homology can be combined together in any combination, such as embodiments that 
have at least 70% homology to a particular sequence wherein the variants are 
conservative mutations. 

132. As this specification discusses various proteins and protein sequences it 

30 is understood that the nucleic acids that can encode those protein sequences are also 
disclosed. This would include all degenerate sequences related to a specific protein 
sequence, i.e. all nucleic acids having a sequence that encodes one particular protein 
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sequence as well as all nucleic acids, including degenerate nucleic acids, encoding the 
disclosed variants and derivatives of the protein sequences. Thus, while each particular 
nucleic acid sequence may not be written out herein, it is understood that each and 
every sequence is in fact disclosed and described herein through the disclosed protein 

5 sequence. For example, one of the many nucleic acid sequences that can encode the 
protein sequence set forth in SEQ ID NO: 3 is set forth in SEQ ID NO: 2. In addition, 
for example, a disclosed conservative derivative of SEQ ID NO: 4 is shown in SEQ ID 
NO: 5, where the isoleucine (I) at position 19 is changed to a valine (V). It is 
imderstood that for this mutation all of the nucleic acid sequences that encode this 

10 particular derivative of the gag gene of the LAI strain of HIV- 1 are also disclosed. It is 
also understood that while no amino acid sequence indicates what particular DNA 
sequence encodes that protein within an organism, where particular variants of a 
disclosed protein are disclosed herein, the known nucleic acid sequence that encodes 
that protein in the particular gag from which that protein arises is also known and * 

IS herein disclosed and described. 
Computer readable mediums 

133. It is understood that the disclosed nucleic acids and proteins can be 
represented as a sequence consisting of the nucleotides of amino acids. There are a 
variety of ways to display these sequences, for example the nucleotide guanosine can 

20 be represented by G or g. Likewise the amino acid valine can be represented by Val or 
V. Those of skill in the art understand how to display and express any nucleic acid or 
protein sequence in any of the variety of ways that exist, each of which is considered 
herein disclosed. Specifically contemplated herein is the display of these sequences on 
computer readable mediums, such as, commercially available floppy disks, tapes, chips, 

25 hard drives, compact disks, and video disks, or other computer readable mediimis. 

Also disclosed are the binary code representations of the disclosed sequences. Those of 
skill in the art understand what computer readable medixuns. Thus, computer readable 
mediums on which the nucleic acids or protein sequences are recorded, stored, or 
saved. 
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Methods of making the compositions 

134. The compositions disclosed herein and the compositions necessary to 
perform the disclosed methods can be made using any method known to those of skill 
in the art for that particular reagent or compound unless otherwise specifically noted. 

S Nucleic acid synthesis 

135. For example, the nucleic acids, such as, tiie oligonucleotides to be used 
as primers can be made using standard chemical synthesis methods or can be produced 
using enzymatic methods or any other known method. Such methods can range from 
standard enzymatic digestion followed by nucleotide fragment isolation (see for 

10 example, Sambrook et aL, Molecular Cloning: A Laboratory Manual, 2nd Edition 

(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N,Y., 1989) Chapters 5, 6) 
to purely synthetic methods, for example, by the cyanoethyl phosphoramidite method 
using a Milligen or Beckman System IPlus DNA synthesizer (for example. Model 
8700 automated synthesizer of Milligen-Biosearch, Burlington, MA or ABI Model 

15 380B). Synthetic methods useful for making oligonucleotides are also described by 
Ikuta et aL, Ann. Rev. Biochem. 53:323-356 (1984), (phosphotriester and phosphite- 
triester methods), and Narang et aL, Methods EnzymoL, 65:610-620 (1980), 
(phosphotriester method). Protein nucleic acid molecules can be made using known 
methods such as those described by Nielsen et al., Bioconjug. Chem. 5:3-7 (1994). 

20 136. Disclosed are cells produced by the process of transforming the cell with 

any of the disclosed nucleic acids. Disclosed are cells produced by the process of 
transforming the cell with any of the non-natxirally occurring disclosed nucleic acids. 

137. Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed nucleic acids. Disclosed are any of the non-naturally 

25 occurring disclosed peptides produced by the process of expressing any of the disclosed 
nucleic acids. Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed non-natural nucleic acids. 

138. Disclosed are animals produced by the process of transfecting a cell 
within the animal with any of the nucleic acid molecules disclosed herein. Disclosed 

30 are animals produced by the process of transfecting a cell within the animal with any of 
the nucleic acid molecules disclosed herein, wherein the animal is a mammal. Also 
disclosed are animals produced by the process of transfecting a cell within the animal 
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with any of the nucleic acid molecules disclosed herein, wherein the mammal is mouse, 
rat, rabbit, cow, sheep, pig, or primate. 

139. Also disclose are animals produced by the process of adding to the 
animal any of the cells disclosed herein. 

S Methods of using the compositions as research tools 

140. The disclosed vectors, compositions and methods can be used for 
targeted gene disruption and modification in any animal that can undergo these events. 
Gene modification and gene disruption refer to the methods, techniques, and 
compositions involved in the selective removal or alteration of a gene or stretch of 

10 chromosome in an animal, such as a mammal, in a way that propagates the 

modification through the germ line of the mammaL In general, a cell is transformed 
with a vector which is designed to homologously recombine with a region of a 
particular chromosome contained within the cell, for example, as described herein. 
This homologous recombination event can produce a chromosome which has 

15 exogenous DNA introduced, for example in frame, with the surrounding DNA. This 
type of protocol allows for very specific mutations, such as point mutations, to be 
introduced into the genome contained within the cell. Methods for performing this type 
of homologous recombination are disclosed herein. 

141 . One of the preferred characteristics of performing homologous 
20 recombination in manmialian cells is that the cells should be able to be cultxared, 

because the desired recombination events cccixr at a low frequency. 

142. Once the cell is produced through the methods described herein, an 
animal can be produced from this cell through either stem cell technology or cloning 
technology. For example, if the cell into which the nucleic acid was transfected was a 

25 stem cell for the organism, then this cell, after transfection and culturing, can be used to 
produce an organism which will contain the gene modification or disruption in germ 
line cells, which can in tum be used to produce another animal tiiat possesses the gene 
modification or disraption in all of its cells. In other methods for production of an 
animal containing the gene modification or dismption in all of its cells, cloning 

30 technologies can be used. These technologies generally take the nucleus of the 

transfected cell and either through fiision or replacement fiise tiie transfected nucleus 
with an oocyte which can then be manipulated to produce an animal. The advantage of 
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procedures that use cloning instead of ES technology is that cells other than ES cells 
can be transfected. For example, a fibroblast cell, which is very easy to culture can be 
used as tiie cell which is transfected and has a gene modification or disruption event 
take place, and then cells derived fi-om this cell can be used to clone a whole animal. 

143. Throughout this application, various publications are referenced. The 
disclosures of these publications in their entireties are hereby incorporated by reference 
into this application in order to more fully describe the state of the art to which this 
invention pertains. The references disclosed are also individually and specifically 
incorporated by reference herein for the material contained in them that is discussed in 
the sentence or section in which the reference is cited. 

144. It will be apparent to those skilled in the art that various modifications 
and variations can be made in the present invention without departing firom the scope or 
spirit of the invention. Other embodiments of the invention will be apparent to those 
skilled in tiiie art firom consideration of the specification and practice of the invention 
disclosed herein. It is intended that the specification and examples be considered as 
exemplary only, with a true scope and spirit of the invention being mdicated by the 
following claims. 

Examples 

145. The following examples are put forth so as to provide those of ordinary 
skill in the art with a complete disclosure and description of how the vectors, 
compounds, compositions, and/or methods claimed herein are made and evaluated, and 
are intended to be purely exemplary of the invention and are not intended to limit the 
scope of what the inventors regard as their invention. Efforts have been made to ensure 
accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors 
and deviations should be accoimted for. Unless indicated otherwise, parts are parts by 
weight, temperature is in ^'C or is at ambient temperature, and pressure is at or near 
atmospheric. 

Example 1 

146. In order to obtain the pl7/p24 fragment of the HTV-l gag gene, pSX 
plasmid containing the gag gene was subjected to 30 cycles of PGR using forward 
Apal-pl7 5' primer and reverse Notl-p24 3' primer. PGR product was cloned into a 
TA cloning vector (pGR2.1 Invitrogen; Garlsbad, GA) and expanded. Fig. 1 shows a 
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map of the empty pHSV vector. The vector contains the viral envelope gene env as 
well as its structural gene gag and viral polymerase gene pol. The vector also possesses 
a transactivator (TAS) gcac hel-1 and 2 LTR that flank the coding region of the vector. 
An inspection of the vector SEQ (SEQ ID NO: 1) shows where various restriction sites 
5 are located on the vector. A complete listing of the sites is found in Fig. 2 and these 
sites are shown in SEQ ID NO: 1. 

147. Inspection of the restriction map revealed Apal and Notl unique 
restriction sites located around the BET (Bel-2) gene at 13522 and 12816 respectively. 
As snch, the BET gene was chosen as the site for integration of the exogenous nucleic 

10 acid (Figure 3). The pl7/p24 were excised from the TA cloning vector using Apal and 
Notl restriction enzymes. Similarly, the empty pHSV vector was cut with Apal and 
NotL 

148. The pl7/p24 segment of gag (789 (start of pl7) - 1876 (end of p24)) 
was ligated to pHSV and the pHSV-GAG was transformed into E. coli via 

1 5 electroporation. The bacteria was plated onto high Amp plates with the pHS V 

conferring ampR between the LTRs on the approximate 3kb plamid. Seven colonies 
were chosen and checked for insertion and proper orientation. Figure 4 shows that 
clone 2, 3, 4 when cut with^pa/ zndNotl possessed a 1 kb fragment. This indicates 
that the pl7/p24 fragment is intact and properly oriented. 

20 149. Knowing that the construct was integrated into the vector, the next 

objective was to examine expression levels of the antigen. Integration is meaningless if 
the desired protein is not expressed. To rest expression of pl7/p24, the loaded pHSV 
vector was transfected using lipofectamine mto BHK cells and supematants were 
collected at various time points post transfection. ELIS A testing for p24 expression 

2S was conducted on the collected supematants (Fig. 5). 

150, Following transfection, clone 2 had clearly measurable p24 expression. 
The p24 expression level for clone 2 was increased at each time point after day 2, 
clones 3 and 4 showed no measurable expression of p24 xmtil day 5 post transfection. 
Subsequent Elisa data revealed that 5 days post transfection clones 2, 3, and 4 could 

30 produce as much as 500 ng/ml, lOOOng/ml, and 800ng/ml of p24 respectively (Fig. 6). 
At the same time in vitro cytopathology was assessed. Cells from the day 2 cultures 
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had very little CPE, However, by day 5 CPE was readily observed. The presence of 
CPE shows that the p24 activity was associated with the replicating viral vector. 

151. Fig. 7 shows p24 expression in clone 2 after 2"** pass. Supematants were 
collected at 3, 7 and 10 days after infection with diluted virus. At day 3 no p24 is 

5 observed. However, by day 7 lOOOng/ml of p24 is observed at the 10"^ dilution, which 
is confirmed at the 10"^ dilution with a measurement of lOOng/ml. By day 10, the 10"^ 
to 10"^ dilutions all had approximately 200ng/ml of p24. As such, day 7 undiluted virus 
would yield 10^ ng/ml p24 and at day 10 the secretion level ranges fi*om 2x10"^ to 2x10^ 
ng/ml of p24. This is an extremely high expression level and indicates likely success in 

10 vivo as this vector is resulting in expression comparable to the highest 25% of vectors. 

152. As a verification that p24 Elisa data was actually measuring pl7/p24 
(p41) and not just p24, western blotting was performed (Fig. 8). Supematants from 
BHK cells infected with clone 2 pHSV-gag, empty pHSV, or mock infected (infected 
with PBS) were run on a polyacrylamide gel and probed for p24, HIV, or LTFOOl as a 

15 positive control. LTFOOl antibody bound to all 3 lanes. However, bands were 

apparent only in the lane for clone 2 when either ariti-p24 or anti-HIV was used as a 
probe. The resulting bands show the smallest band to be 41kd, which corresponds to 
the size of pl7/p24. 

153. Histological observations were made from in vitro culture of pHSV in 
20 BHK cells. Cells infected with pHSV-gag showed CPE; however, this CPE is not 

usually observed in vivo. 
Methods 

Cloning and PCR 

154. The pi 7 and p24 segments of the LAI strain of HIV- 1 gag were cloned 
25 into a pHSV spumavirus vector. Briefly, pSX plasmid containing the LAI strain HIC-1 

gag gene was obtained from the HIV repository. The pSX plasmid was used as a 
template for PCR amplification and cloning of pl7 and p24. A forward Apal-pl7 5' 
primer TCC GGG CCC GGA ATG CCT ATA GTC CAG AAC ATC C (SEQ ID NO: 
6) and a reverse Notl-p24 3' primer GCG GCC GCG TTT TGA GAA CGA AAT 
30 ACC GG (SEQ ID NO: 7) were used to amplify p4L The amplified segment was 
cloned into a TA cloning vector (pCR2.1; Invitrogen Carlsbad, CA) and expanded. 
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Restriction digest of the TA-p41 vector with Apal and Notl removed pl7/24 jSiom the 
vector. The p41 segment was then inserted into pHSV within the bet (bel'2) gene. 
Transformation 

155. The pHSV vector and pl7/24 antigen were ligated together with ligase 
5 for 16hrs at 4C. Following ligation, pHSV-gag was electroporated into E. coli and 

incubated for Ihr at 37C in 2xyT broth media. The samples are then plated on high 
Ampicillin plates and incubated overnight at 37C. Colonies growing on the high Amp 
plates were selected and transferred to 2xyT broth media overnight at 37C. Minipreps 
were performed on the ovemight cultures. Plasmids were cut with Apal and Notl and 
10 aliquots were run on 1% agarose gel. 

ELISA and Western Blot 

156. ELISA and Westem blot were performed using techniques commonly 
used in the art. 

Example 2 

15 157. Animal subjects can be used to screen the effectiveness of a pHSV 

vector and an antigen-encoding nucleic acid. Additionally, animal subjects may be 
used to study the vector-antigen combination's ability to prevent or treat a condition. 
The vector can also be used to induce a condition in an animal that is associate with a 
disease, and that animal can then be used to study the disease/condition or to study 

20 potential treatments for the disease/condition. Such conditions can be infections 
resulting from vimses, bacteria, or parasites; autoinunune reactions including 
inflammatory diseases, asthma, systemic lupus erythamatosis, muscular dystrophy, or 
multiple sclerosis, diabetes, tay-sachs, spinobifida, cerebral palsy, parkinson's disease, 
lou gehrigg disease, alzheimer's, hemophelia, Addsion's disease, Gushing' s disease; or 

25 cancer. Animals can include but are not limited to mice, rats, pig, dog, monkey, 
chimpanzee, and himian. 

158. Mice are injected with a sub-inmiunizing dose of pHSV-gag. O.Smg/ 
animal of pHSV-gag are administered via gene gun. This same immimizing dose may 
be administered by an alternative route such as intramuscular, intravenous, 

30 intraperitoneal, intracranial, or subcutaneous. Mice are then monitored and starting at 
15 days post immunization mice are bled every three days to day 30 post inmiunization. 
Blood samples are collected for serum extraction and for the isolation of peripheral 
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blood Ijncnphocytes (PBL). Serum samples can be used immediately for ELISA assays 
to look for antibody to the antigen encoded in the vector or for cytokine increases 
following immimization. Samples can also be frozen and stored at -80C for testing at a 
later time. 

5 159. Lypmhocytes are removed from whole blood and diluted in media (e.g. 

RPMI 1640, DMEM, MEM, or EMEM) with 10% serum. PBL are then stained for 
antigen specific T cells using antigen specific MHC class I and MHC class 11 tetramers 
to be visualized via flow cjrtometry. Simultaneously or separately PBL are stained with 
surface markers to look at activation characteristics of the cells and other surface 

10 phenotypes. Some examples of the antibodies used are CDS, CD4, CDS, CDl la (LFA- 
1), CD43 (IBl 1), CD44, CD45RA, CD45RO, CD62L (L-selectin), CD69 and Bcl-2. 
Additionally, PBL can be stained for the production of intracellular cytokines such as 
IFN-y, IL-4, TNF-a, IL-2, and IL-10 following stunulation with antigen. Lymphocytes 
can also be used in an ELISPOT assay for cytokine production following stimxilation 

1 5 with specific antigen or CTL assay to assess killing activity. 

. 160. All of the stains and assays allow for the assessment of cell-mediated . 
responses to the priming antigen. Tetramer staining allows for the enumeration of 
antigen specific T cells of a known specificity. This coupled with surface markers 
details the activation state of the antigen specific cells. Tetramer positive T cells 

20 (CD8+/CD3+, CD3+, or CD3+/CD4+) possess various surface mai-kers that reveal the 
activation state of the cells. By multiplying percentages obtained from the analysis of 
the staining by the nimiber of cells, the exact number of cells exhibiting any 
characteristic can be obtained. Exposure to an antigen creates cells that express high 
levels of CDl la, and CD44. Once these levels are increased on a cell, they will not 

25 decrease, and in conjunction with tetramer stains and other surface markers can lead to 
the identification of memory T cells. CD43 and CD69 both increase during activation, 
but decrease following clearance of the priming antigen. L-selecting will decrease on 
the surface of activated cells and increase over time to naive levels. Bcl-2 also provides 
a measure of the activation level of cells as activated cells will have decreased Bcl-2 

30 levels relative to naive T cells indicating a susceptibility to apoptosis whereas memory 
cells will have increased levels of Bcl-2. 
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161. ELISPOT and intracellular cytokine stains allow the enumeration of 
cytokine secreting antigen-specific cells following stimulation. This can be paired with 
the number of tetramer positive cells to determine what portion, if not all, of the T cells 
are functional. 

5 162. CTL assays determine the ability for antigen specific T cells to kill cells 

e3q)ressing the antigen. The assay reads on the cytolitic activity of T cells as 
determined by release of a chemilimiinescent marker or radioactive material in labeled 
target cells. 

163. Like the cell-mediated responses, himioral responses may be measured 
10 in harvested PBL. PBL maybe stained for anti-mouse IgG, CD45R (B220), PNA, and 

CD138 (syndecan-1) to look at the activation of B cells or the mmiber of plasma cells. 
Plasma cells, the effector arm of the humoral immune system are syndecan-1 +, and 
B220-. Memory B cells will be IgG+, B220+, and PNA+, the presence of PNA 
positive cells indicates transition through a germinal center reaction. Additionally^ 
15 PBL may be used to enumerate the number of antigen specific plasma cells on a plasma 
cell elispot or the number of antigen specific B cells on a memory B cell assay. 

1 64. ELIS A data can be combined with the humoral immimity data from the 
PBL to assess the effect of the immunization on the mouse. 

1 65. Mice that have measurable responses can be boosted and responses 

20 monitored. However, direct challenge is necessary as it indicates liie ability to moxmt a 
protective immime response. Mice that do not have an immune response or mice that 
have an immune response that but need to be boosted can receive second and third 
immunizations by the same or different route than the priming immunization over the 
next few months. 

25 166. As mice cannot be infected with a lentivims, monkeys (e.g., rhesus 

macaques) are used for challenge experiments. Monkeys can receive a similar 
constmct (pHSV-SIV-gag) which produces similar levels of gag in vitro. Monkeys are 
inoculated with the pHSV-SIV-gag DNA, and after several weeks, they can be 
challenged with an infectious SIV. The animals can then be observed for protection 

30 from infection or disease. 

167. For a challenge, monkeys are given an infectious dose of an antigen (e.g. 
SIV) or stimulated in such a way as to induce a condition to be prevented. Challenged 
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monkeys can be assessed in the same way as immunized mice with additional assays 
used to look at antigen load following challenge and gross pathological and histological 
assessments made from tissue biopsy or necropsy, 

168. Monkeys can also be given an infectious dose of an antigen (e.g. SIV) or 
5 stimulated in such a way as to create a condition to be treated. Once the condition is 

established, monkeys are given a therapeutic dose of the immimizing antigen and 
monitored as to the effects on the condition. A successful therapy will not necessarily 
clear the condition or infection, but may slow or stop progression. The same assays to 
those used for prophylactic immunization are used to characterize the responses 
10 Methods 

Flow cytometry 

169. Cells are suspended at a concentration that is appropriate according to 
the antibody manufacturers instructions in FACS buffer (2% FCS (or BSA) in PBS 
(0.2% NaNs can also be added)). Antibodies (CDS, CD4, CD8, CDl la (LFA-10; CD43 

15 (IBll), CD44, CD45R (B220), CD45RA, CD45RO, CD62L (L-selectin), CD69, 

CD 138, PNA, anti-mouse IgG) are added to the cells according to the manufacturers 
instructions and incubated for 30min at 4C in the dark. Cells are washed 3X in FACS 
Buffer. 1 wash comprises centrifiiging the cells at SOOrpm for 3min to pellet the cells, 
removing the media, and resuspending the cells. After the third wash, cells are 

20 resuspended in 2% PFA in PBS. Staining is analyzed on a FACSCalibur flow 
cytometry instrument (Beckton-Dickenson) or other suitable cytometer. 
Intracellular cytokine staining. 

170. Cells are stimulated for 5hrs in the presence of antigen and BFA 
(meninsen may be substituted for BFA) at 37C (this stimulation is not needed for Bcl-2 

25 staining). After the 5hr incubation, cells are centrifiiged and the media removed and 
resuspended in FACS buffer (2% FCS (or BSA) in PBS (0.2% NaNs can also be 
added)). Antibodies (CD3, CD4, CDS, CDl la (LFA-1), CD43 (IBl 1), CD44, CD45R 
(B220), CD45RA, CD45RO, CD62L (L-selectin), CD69, CD138, PNA, anti-mouse 
IgG) are added to the cells according to the manufacturers instructions and incubated 

30 for 30min at 4C in the dark. Cells are washed 3X in FACS Buffer. 1 wash comprises 
centrifiiging the cells at 800rpm for 3min to pellet the cells, removing the media, and 
resuspending the cells. After the third wash, cells are resuspended in cytofix/cytoperm 
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solution for 20inin at 4C. Cells are then washed 3X in a pranwash solution and 
resuspended at a concentration that is appropriate according to the antibody 
manufacturers instructions in permwash. Antibodies (IFN-y, TNF-a, IL-2, IL-4, IL-10, 
and Bcl-2) are added to the cells according to the manufacturers instructions and 

5 incubated for 30min at 4C in the dark. Cells are washed 3X in PermWash and 2X in 
FACS Buffer. After liie second FACS Buffer wash, cells are resuspended in 2% PFA 
in PBS. Staining is analyzed on a FACSCalibur flow cytometry instrument (Beckton- 
Dickenson) or other suitable cytometer. Intracellular kits are commercial available 
through (Becktbn-Dickenson). 

10 ELISA 

171 . 96-well ELISA plates are coated with antigen overnight at 4C. Plates 
are blocked with a suitable blocking media containing 10% FCS for l-2hrs at RT. 
Plates are resuspended in a known volume of an ELISA diluent (e.g., PBS+10% FCS) 
and serum is added to the plate and incubated at RT for 1 .5hrs. Platea are washed with 

15 PBS+0.1%Tween 3X and lOOjiil of antibodies (e.g., HRPO conjugated anti-mouse IgG) v ' 
diluted 1 : 1000 in ELISA diluent is added to each well. Plates are incubated for 1 .5hrs 
at RT and then washed 3x with PBS+0.1%Tween, Plates are then coated with 100|al of 
a chromagen substrate (e.g., o-phenyldiamine + 3% H2O2 in citrate buffer (Sodixmi 
citrate in H2O pH=5) and incubated for Ihr in the dark. The reaction is stopped by 

20 adding lOOjil of IN HCl. Plates are read in a ELISA plate reader. 
Plasma cellELISPOT 

172. 96-well filter (ELISPOT) plates are coated with antigen overnight. 
Plates are washed IX in PBS-0.1%Tween and 3X in PBS. Plate are blocked l-2hrs 
with media + 10% FCS. After blockmg, media is removed and lOO^il of media are 

25 added to each well. Effector cells (cells containing lymphocytes ft-om an immimized 
animal (e.g., PBL, splenocytes, hepatocytes, and bone marrow) are added to the plate 
and serial dilutions are made. Plates are incubated for 5hrs at 37C. After the 5hr 
incubation, plates are washed 3X in PBS and 3X in PBS+0.1% Tween. Plates are then 
coated with 100|al PBS+0.1% Tween+1% FCS and biotinylated Ab at 1/100 

30 concentration ovemight at 4C. After incubation plates are washed 4X in 

PBS+0. l%Tween and incubated for Ihr at RT in PBS+0. l%TweenH-l%FCS containing 
HRP-avidmD at 1/1000. Plates are washed 3X with PBS and a suitable chromagen 
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substrate is added (e.g,, AEC). Plates are allowed to develop for 8-15min and the 
reaction is stopped with washing the plates with water. Spots are visualized and 
counted using a dissection microscope or ELISPOT plate reader. 
Cytokine ELISPOT 

5 173. The day prior to the assay coat each well with 50 jal 10 [ig/ml purified 

anti-citokine antibody (e.g., IFN-y, TNF-a, IL-4, IL-10). Incubate overnight at 4 
° C. Wash plates 4 times with 200 |j.l PBS. Block wells with 200 |il 10%FCS complete 
medium, incubate at room temperature for at least 1 hour. Add effector cells to the top 
wells and make serial dilutions. Discard 100 ^tl of cells fi^om the last dilution. Add the 

10 stimulating antigen and IL-2 to the wells. Incubate plates at 37 ° C, 5% CO2 for 16-40 
hours (use level to ensure that plates stay even horizontally). Discard cells and wash 5 
times with PBS-Tween). Add 100 |il 2 jig/ml biotinylated-anti-cytokine antibody (e.g., 
IFN-7, TNF-a, IL-2, IL-4, IL-10) diluted in PBS-Tween. Incubate at room temperature 
for 2 hours or overnight at 4C. Wash plate 5 times with PBS-Tween, 200^il each wash. 

15 Add 100 \xl 1 :1000 Strepavidin -HRP, incubate 1 hoxir at room temperature, (avoid 
high background by not exceeding 1 hour incubation). Wash wells 5 times with PBS- 
Tween. Add 100 |Jil chromagen substrate per well, incubate at room temperature for 
10-20 minutes in dark (at 10 minutes, observe the color developed, if the color is not 
fiilly intense, incubate few more minutes and double check). Wash plates imder 

20 running tap water and air-dry plates (in hood with light tumed-off). Count and record 
numbers of spots/well by ELISPOT plate reader or stereomicroscope (20x). 
CTL 

174. Direct ex-vivo cytolytic killing is measured in a 5hr ^'Cr release assay. 
Briefly, target cells are labeled with 51Cr for Ihr at 37C. Following the Ihr incubation, 

25 cells are centrifuged for 8min at 1200rpm to pellet the cells. The media is poured off 
and the pellet resuspended in fresh media. Cells are centrifuged and resuspended two 
additional times. After the third spin, cells are resuspended in an appropriate amount of 
media to attain the desired effector to target ration (e.g., a 50:1 effector: target ratio 
with effector cells at a concentration of 1x10^/ ml would require target cells to be at a 

30 concentration of 2xl0Vml), and contacted with antigen. Effector cells (cells containing 
lymphocytes from an immimized animal (e.g., PBL, splenocytes, hepatocytes, and bone 
marrow) are contacted with the target cells for 5-6hrs at 37C. Cells are pelletted by 
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centrifixgation and supematants harvested. ^^Cr release is measured on an appropriate 
instrument. Percent killing is (experimental release-spontaneous release / maximum 
release — spontaneous release) x 100. 

175. Secondary killing to measure memory cell responses to antigen can also 
5 be measured. In this method, effector cells are incubated for 6 days in the presence of 

antigen stimulation and then used in a cytolytic assay as described above. 
Example 3 

176. In accordance with the methods described above the pHSV vector comprising an 
SIV-Gag-pl7/p27 construct was made. This vector is useful to establish a model for the study of 

10 the effectiveness of a particular treatment or prophylactic vaccine. For example, such a model can 
comprise a system to establish a foamy live viral vector for vaccine use. The pHS V with an SIV 
gag (pl7,p27) insert has been tested in vitro and shown to express similar levels of SIV gag as the 
HIV gag engineered vector. This SIVgag live viral vector can be used to inoculate rhesus macaques 
in order to determine protective immune responses that can develop following in vivo e?q)ression of 

15 the vector gene products including the SIV gag protein. Inoculated animals can then be challenged 
with wild type SIV to further determine any potential vaccine induced efiBcacy by studying primary . 
(sterilizing immunity) and secondary (time to morbidity) end points. 
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CLAIMS 

What is claimed: 

1. A live replicating human spumavirus vector (pHSV) suitable for human xise 
comprising an immunizing constmct, wherein the immunizing construct partially 
replaces the bet gene. 

2. The vector of claim 1, wherein the vector has the sequence of SEQ ID NO: 1 . 

3. The vector of claim 1, further comprising a reporter gene. 

4. The vector of claim 3, wherein the reporting gene is GFP, 

5. The vector of claim 1, wherein the immimizing construct is an antigen encoding 
nucleic acid. 

6. The vector of claim 5, wherein the antigen-encoding nucleic acid encodes a viral 
antigen. 

7. The vector of claim 6, wherein the viral antigen is an antigen from a virus selected 
from the group consisting of Herpes simplex virus type- 1, Herpes simplex virus type-2, 
Cytomegalovirus, Epstein-Barr vims. Varicella-zoster virus, Human herpesvirus 6, 
Human herpesviras 7, Human herpesvirus 8, Variola virus, Vesicular stomatitis virus. 
Hepatitis A virus. Hepatitis B virus. Hepatitis C virus. Hepatitis D virus. Hepatitis E 
virus, Rhinovirus, Coronavirus, Influenza virus A, Influenza virus B, Measles vims, 
Polyomavuiis, Human Papilomavims, Respiratory syncytial vims. Adenovirus, 
Coxsackie vims. Dengue virus, Mxmips vims, Poliovirus, Rabies vims, Rous sarcoma 
virus. Yellow fever virus, Ebola virus, Marburg vims, Lassa fever virus. Eastern 
Equine Encephalitis vims, Japanese Encephalitis vims, St. Louis Encephalitis vims, 
Murray Valley fever vims. West Nile virus, Rift Valley fever vims, Rotavims A, 
Rotavims B, Rotavims C, Sindbis vims, Simian Immunodeficiency cims. Human T- 
cell Leukemia vims type-1, Hantavims, Rubella vims. Simian Immunodeficiency vims, 
Himian Immimodeficiency vims type-1, and Human Immunodeficiency vims type-2. 

8. The vector of claim 7, wherein the viral antigen is HIV-GAG pl7/p24. 

9. The vector of claim 7, wherein the immimizing construct is SIV-GAG. 

10. The vector of claim 5, wherein the antigen-encoding nucleic acid encodes a 
bacterial antigen. 

1 1 . The vector of claim 10, wherein the bacterial antigen is an antigen from a 
bacterium selected from the group consisting of M tuberculosis, M. bovis, M. bovis 
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Strain BCG, BCG substrains, M avium, M. intracellulare, M. afiicanum, M kansasii, 
M. marinum, M. ulcerans,.M. avium subspecies paratuberculosis, Nocardia asteroides, 
o\hs>r Nocardia spocics, Legionella pneumophila, oihoT Legionella species. Salmonella 
typhi, other Salmonella species. Shigella species. Yersinia pestis, Pasteurella 
haemolytica, Pasteurella multocida, oHier Pasteurella species, Actinobacillus 
pleuropneumoniae. Listeria monocytogenes. Listeria ivanovii, Brucella abortus, otiier 
Brucella species, Cowdria ruminantium. Chlamydia pneumoniae. Chlamydia 
trachomatis. Chlamydia psittaci, Coxiella burnetti, other Rickettsial species. Ehrlichia 
species, Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus pyogenes. 
Streptococcus agalactiae, Bacillus anthracis, Escherichia colt. Vibrio cholerae, 
Campylobacter species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas 
aeruginosa, other Pseudomonas species, Haemophilus influenzae, Haemophilus 
ducreyi, oihev Hemophilus species, Clostridium tetani, other Clostridium 
species. Yersinia enterolitica, and other Yersinia species. 

12. The vector of claim 5, wherein the peptide encoding nucleic acid encodes a 
parasitic antigen. 

13. The vector of claim 12, wherein the parasitic antigen is an antigen from a parasite 
selected from the group consisting of Toxoplasma gondii, Plasmodium falcipanim, 
Plasmodium vivax, Plasmodium malariae, oihev Plasmodium species., Trypanosoma 
brucei, Trypanosoma cruzi, Leishmania major, other Leishmania species., Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica. 

14. The vector of claim 5, wherein the peptide encoding nucleic acid encodes a cancer 
antigen. 

15. The vector of claim 14, wherein the cancer antigen is an antigen from a cancer 
selected from the group consisting of lymphomas (Hodgkins and non-Hodgkins), B cell 
lymphoma, T cell lymphoma, leukemias, myeloid leukemia, carcinomas, carcinomas of 
solid tissues, squamous cell carcinomas of mouth, throat, larynx, limg, head and neck, 
adenocarcinomas, sarcomas, gliomas, high grade gliomas, blastemas, neuroblastomas, 
plasmacytomas, histiocytomas,, melanomas, adenomas, hypoxic tumours, myelomas, 
AIDS-related lymphomas or sarcomas, metastatic cancers, mycosis ftmgoides, bladder 
cancer, brain cancer, nervous system cancer, head and neck cancer, kidney cancer, lung 
cancers such as small cell lung cancer ^d non-small cell lung cancer, 
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tieuroblastoina/glioblastoma, ovarian cancer, liver cancer, colon cancer, cervical 
cancer, cervical carcinoma, breast cancer, and epithelial cancer, renal cancer, 
genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck 
carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and 
rectal cancers, prostatic cancer, or pancreatic cancer. 

16. The cancer antigen of claim 15, wherein the antigen is selected from the list of 
cancer antigen consisting of mucin (Muc-1; a 20 amino acid core repeat for Muc-1 
glycoprotein, present on breast cancer cells and pancreatic cancer cells), the Ha-ras 
oncogene product, p53, cmrcino-embryonic antigen (CEA), the raf oncogene product, 
gpl00/pmell7, GD2, GD3, GM2, TF, sTn, MAGE-1, MAGE-3, BAGE, GAGE, 
tyrosinase, gp75, Melan-A/Mart-1, gplOO, HER2/neu, EBV-LMP 1 & 2, HPV-F4, 6, 7, 
prostate-specific antigen (PSA), HPV-16, MUM, alpha-fetoprotein (AFP), C017-1 A, 
GA733, gp72, p53, the ras oncogene product, HPV E7, Wilm's tumor antigen-1, 
telomerase, and melanoma gangliosides. 

17. A cell comprising the vector of claim 1 . 

18. A method of detecting the expression of the vector of claim I, comprising 
obtaining a sample containing the vector, contacting the sample with an aiitibody 
directed to antigen encoded by the iinmimizing construct, and detecting antigen bound, 
to the antibody, the presence of bound antigen indicating expression of the vector. 

19. The method of claim 18, wherein the sample is blood from a subject. 

20. The method of claim 18, wherein the sample is tissue biopsy from a subject. 

2 1 . A method of treating, a subject with a condition comprismg administering to the 
subject the vector of claim 1 . 

22. The method of claim 21, wherein xhe condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of Herpes simplex viixis type- 1 , Herpes simplex virus type-2. 
Cytomegalovirus, EpsteiurBarr virus. Varicella-zoster virus, Iluman herpesyiirus .6, 
Hiunan heipesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis virus. 
Hepatitis A virus, Hepatitis B virus,. Hepatitis C virus, Hepatiti? D virus, Hepatitis E 
virus, Rhinoyiius, Coronaviras, Influenza virus A, Influenza virus B, Measles viiois, 
Polyomaviais, Himian Papilomavirus, Respiratory syncytial vims, Adenovirus, 
Coxsackie virus. Dengue virus, Miunps vims, Poliovirus, Rabies viriis, Rous sarcoma 
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viras, Yellow fever virus, Ebola virus, Marburg virus, Lassa fever virus, Eastern 
Equine Encephalitis virus, Japanese Encephalitis virus, St Louis Encephalitis virus, 
Murray Valley fever virus, West Nile virus. Rift Valley fever virus. Rotavirus A, 
Rotavirus B, Rotavirus C, Sindbis virus. Simian Immunodefipiency cirus, Hunoan T*- 
cell Leukemia virus type-1, Hantavirus; Rubella virus, Siroian Immunodeficiency virus. 
Human Immxmodeficiency virus type-1, and Human Immxmodeficiency virus type-2. 

23. The method of claim 21, wherein the condition is associated with bacterial 
infection, and the immimizing construct encodes an antigen from a bacterium selected 
from the group sonsisting of M tuberculosis, M bovis, M bovis strain BCG, BCG 
substrains, M avium, M intracellulare, M. africanum, M. kansasii, M marinum, M 
ulcerans, M avium subspecies paratuberculosis, Nocardia asteroides, ofh&r Nocardia 
species, Legionella pneumophila, oiiiGr Legionella species. Salmonella typhi, other 
Salmonella species. Shigella species. Yersinia pestis, Pasteurella haemolytica, 
Pasteurella multocida, other Pasteurella species, Actinobacillus pleuropneumoniae. 
Listeria monocytogenes. Listeria ivanovii. Brucella abortus, other Brucella species, 
Cowdria rumiriantium, Chlamydia pneumoniae. Chlamydia trachomatis. Chlamydia 
psittaci, Coxiella bumetti, other Rickettsial species. Ehrlichia species. Staphylococcus 
aureus, Staphylococcus epidermidis. Streptococcus pyogenes, Streptococcus 
agalactiae, Bacillus anthracis, Escherichia coli, Vibrio cholerae, Campylobacter 
species, Neiserria meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other 
Pseudomonas species, Haemophilus influenzae, Haemophilus ducreyi, other 
Hemophilus species, Clostridium tetani, other Clostridium species. Yersinia 
enterolitica, and other Yersinia species. 

24. The method of claim 21, wherein the condition is associated with parasitic 
infection, and the immimizing construct encodes an antigen from a parasite selected 
from the group consistiag. of Toxoplasma gondii, Plasmodium falciparum, Plasmodium 
vivax, Plasmodium malariae, other Plasmodium species., Trypanosoma brucei^ 
Trypanosoma cruzi, Leishmania major, other Leishmania species.. Schistosoma 
mansoni, other Schistosoma species., and Entamoeba histolytica. 

25. The method of claim 21, wherein the condition is cancer, and the immunizing 
construct encodes a tumor antigen from the list consisting of hmnan epithelial cell 
mucin (Muc-1; a 20 amino acid core repeat for Muc-1 glycoprotein, present on breast 
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cancer cells and pancreatic cancer cells), the Ha-ras oncogene product, p53, carcino- 
embryonic antigen (CEA), the raf oncogene product, gpl00/pmell7, GD2, GD3, GM2, 
TF, sTn, MAGE-1, MAGE-3, BAGE, GAGE, tyrosinase, gp75, Melan-A/Mart-l, 
gplOO, HER2/neu, EBV-LMP 1 & 2, HPV-F4, 6, 7, prostate-specific antigen (PSA), 
HPV-16, MUM, alpha-fetoprotein (AFP), C017-1A, GA733, gp72, p53, the ras 
oncogene product, HPV E7, Wilm's tumor antigen- 1, telomerase, and melanoma 
gangliosides. 

26. A method of preventing a condition in a subject comprising administering to the 
subj ect the vector of claim 1 . 

27. The method of claim 26, wherein the condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of Herpes simplex virus type-1. Herpes simplex virus type-2, 
Cytomegalovirus, Epstein-Barr vims. Varicella-zoster virus. Human herpesvirus 6, 
Hiraian herpesvirus 7, Human herpesvirus 8, Variola virus. Vesicular stomatitis vims. 
Hepatitis A virus. Hepatitis B vims. Hepatitis C virus. Hepatitis D vims. Hepatitis E 
vims, Rhinovims, Coronavims, Influenza viras A, Influenza vims B, Measles vims, 
Polyomavirus, Human Papilomaviras, Respiratory syncytial virus. Adenovirus, 
Coxsackie vims. Dengue vims, Mxmips vims, Poliovims, Rabies virus, Rous sarcoma 
vims, Yellow fever vims, Ebola viras, Marburg vims, Lassa fever viras, Eastem 
Equine Encephalitis viras, Japanese Encephalitis viras, St. Louis Encephalitis viras, 
Murray Valley fever viras, West Nile viras, Rift Valley fever viras, Rotaviras A, 
Rotavims B, Rotaviras C, Sindbis viras, Simian Immunodeficiency ciras, Human T- 
cell Leukemia viras type-1, Hantaviras, Rubella viras, Simian Immunodeficiency viras. 
Human Immunodeficiency viras type-1, and Himian Immunodeficiency viras type-2. 

28. The method of claim 26, wherein the condition is associated with viral infection, 
and the immunizing constract encodes an antigen from a viras selected from the group 
consisting of M tuberculosis, M bovis, M. bovis strain BCG, BCG substmins, M 
avium^ M intracellularey M africanum, M. kansasii, M marinum, M ulcerans, M 
avium subspecies paratuberculosis, Nocardia asteroides^ oihcr Nocardia species, 
Legionella pneumophila^ ofh^r Legionella species. Salmonella typhi, other Salmonella 
species. Shigella species. Yersinia pestis, Pasteurella haemolytica, Pasteurella 
multocida, other Pasteurella species, Actinobacillus pleuropneumoniae, Listeria 
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monocytogenes. Listeria ivanovii. Brucella abortus, other Brucella species, Cowdria 
ruminantium. Chlamydia pneumoniae^ Chlamydia trachomatis. Chlamydia psittaci, 
Coxiella bumettiy oHxet Rickettsial species. Ehrlichia species. Staphylococcus aureus. 
Staphylococcus epidermidis, Streptococcus pyogenes. Streptococcus agalactiae. 
Bacillus anthracis, Escherichia coli, Vibrio cholerae, Campylobacter species, Neiserria 
meningitidis, Neiserria gonorrhea, Pseudomonas aeruginosa, other Pseudomonas 
species, Haemophilus influenzae^ Haemophilus ducreyi, other Hemophilus species, 
Clostridium tetani, other Clostridium species. Yersinia enterolitica, and other Yersinia 
species, 

29. The method of claim 28, wherein the condition is associated with viral infection, 
and the immunizing construct encodes an antigen from a virus selected from the group 
consisting of Toxoplasma gondii, Plasmodium falciparum, Plasmodium vivax, 
Plasmodium malariae, oQier Plasmodium species., Trypanosoma brucei, Trypanosoma 
cruzi, Leishmania major, other Leishmania species.. Schistosoma mansoni, other 
Schistosoma species., and Entamoeba histolytica. 

30. A method of making a model for use in screening for substances effective in 
treating or preventing a disease associated with an immimizing construct, comprising 
administering the vector of claim 1 to an animal capable of manifesting a characteristic 
of the immunizing construct. 

31. A model made in accordance with the method of claim 30. 

32. A method of screening for a substance effective in treating a disease associated 
with an immimizing construct comprising: a) administering the substance to the model 
of claim 31; and b) assaying for an change in the course of the disease as compared to 
an the course of the disease in a control subject; an improvement in the course of the 
disease in the presence of the substance identifies a substance that is effective in 
treatmg the disease. 

33. A method of screening for an immunizing construct effective in preventing a 
disease associated with an immunizing construct comprising: a) administering the 
vector of claim 1 to a subject; b) subjecting the subject to treatment that will induce the 
disease or characteristic of the disease; and c) assaying for an change in the course of 
the disease as compared to an the course of the disease in a control subject; an. 
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improvement in the course of the disease in the presence of the substance identifies a 
substance that is effective in preventing the disease. 

34. A method of screening for an immunizing construct effective in treating a disease 
effective in treating a disease associated with an immunizing construct comprising: a) 
subjecting tib.e subject to treatment that will induce the disease or characteristic of the 
disease; b) administering the vector of claim 1 to a subject; and c) assaying for an 
change in the course of the disease as compared to an the course of the disease in a 
control subject; an improvement in the course of the disease in the presence of the 
substance identifies a substance that is effective in treating the disease. 



35. 



The method of claims 21 



34, wherein the subject is a cell. 



36. 



37. 



38. 



The method of claims 21 



The method of claims 21 



The method of claims 21 



34, wherein the subject is a mouse. 
34, wherein the subject is a monkey, 
•34, wherein the subject is a human 
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SEQ. ID NO. I 



1 


CAGGTGGCAC 
GTCCACCGTG 


TTTTCGGGGA 
AAAAGCCCCT 


AATGTGCGCG 
TTACACGCGC 


GAACCCCTAT 
CTTGGGGATA 


TTGTTTATTT 
AACAAATAAA 


SI 


TTCTAAATAC 
AAGATTTATG 


ATTCAAATAT 
TAAGTTTATA 


GTATCCGCTO 
CATAGGCGAG 


ATGAGACAAT 
TACTCTGTTA 


AACCCTGATA 
TTGGGACTAT 


101 


AATGCTTCAA 
TTACGAAGTT 


TAATATTGAA 
ATTATAACTT 


AAAGGAAGAG 
TTTCCTTCTC 


TATGAGTATT 
ATACTCATAA 


CAACATTTCC 
GTTGTAAAGG 


151 


GTGTCGCCCT 
CACAGCGGGA 


TATTCCCTTT 
ATAAGGGAAA 


TTTGCGGCAT 
AAAC6CCGTA 


TTTGCCTTCC 
AAACGGAAGG 


TGTTTTTGCT 
ACAAAAACGA 


201 


CACCCAGAAA 
GTGGGTCTTT 


CGCTGGTGAA 
GCGACCACTT 


AGTAAAAGAT 
TCATTTTCTA 


GCTGAAGATC 
CGACTTCTAG 


AGTTGGGTGC 
TCAACCCACG 


251 


ACGAGTGGGT 
TGCTCACCCA 


TACAtCGAAC 
ATGTAGCTTG 


TGGATCTCAA 
ACCTAGAGTT 


CAGCGGTAAG 
GTCGCCATTC 


ATCCTTGAGA 
TAGGAACTCT 


301 


GTTTTCGCCC 
CAAAAGCGGG 


CGAAGAACGT 
GCTTCTTGCA 


TTTCCAATGA 
AAAGGTTACT 


TGAGCACTTT 
ACTCGTGAAA 


TAAAGTTCTG 
ATTTCAAGAC 


351 


CTATGTGGCG 
GATACACCGC 


CGGTATTATC 
GCCATAATAG 


CCGTGTTGAC 
GGCACAACTG 


GCCGGGCAAG 
CGGCCCGTTC 


AGCAACTCGG 
TCGTTGA1GCC 


401 


TCGCCGCATA 
AGCGGCGTAT 


CACTATTCTC 
GTGATAAGAG 


AGAATGACTT 
TCTTACTGAA 


GGT^'GAGTAC 
CCAACTCATG 


TCACCAGTCA 
AGTGGTCAGT 


451 


CAGAAAAGCA 
GTCTTTTCGT 


TCTTACGGAT 
AGAATGCCTA 


GGCATGACAG 
CCGTACTGTC 


TAAGAGAATT 
ATTCTCTTAA 


ATGCAGTGCT 
TACGTCACGA 


501 


GCCATAACCA 
CGGTATTGGT 


TGAGTGATAA 
ACTCACTATT 


CACTGCGGCC 
GTGACGCCGG 


AACTTACTTC 
TTGAATGAAG 


TGACAACGAT 
ACTGTTGCTA 


551 


CGGAGGACCG 
GCCTCCTGGC 


AAGGAGCTAA 
TTCCTCGATT 


CCGCTTTTTT 
GGCGAAAAAA 


GCACAACAT6 
CGTGTTGTAC 


GGGGATCATG 
CCCCTAGTAC 


601 


TAACTCGCCT 
ATTGAGCGGA 


TGATCGTTGG 
ACTA6CAACC 


GAACCGGAGC 
CTTGGCCTCG 


TGAATGAAGC 
ACTTACTTCG 


CATACCAAAC 
GTATGGTTTG 


651 


GACGAGCGTG 
CTGCTCGCAC 


ACACCACGAT 
TGTGGTGCTA 


GCCTGCAGCA 
CGGACGTC6T 


ATGGCAACAA 
TACCGTTGTT 


CGTTGCGCAA 
GCAACGCGTT 


701 


ACTATTAACT 
TGATAATTGA 


GGCGAACTAC 
CCGCTTGATG 


TTACTCTAGC 
AATGAGATCG 


TTCCCGGCAA 
AAGGGCCGTT 


CAATTAATAG 
GTTAATTATC 


751 


ACTGGATGGA 
TGACCTACCT 


GGCGGATAAA 
CCGCCTATTT 


GTTGCAGGAC 
CAACGTCCTG 


CACTTCTGCG 
GTGAAGACGC 


CTCGGCCCTT 
GAGCCGGGAA 


801 


CCGGCTGGCT 
GGCCGACCGA 


GGTTTATTGC 
CCAAATAACG 


TGATAAATCT 
ACTATTTAGA 


GGAGCCGGTG 
CCTCGGCCAC 


AGCGTGGGTC 
TCGCACCCAG 


851 


TCGCGGTATC 
AGCGCCATAG 


ATTGCAGCAC 
TAACGTCGTG 


TGGGGCC^GA 
ACCCCGGTCT 


TCCATTCGGG 
AGGTAAGCCC 


• AGGGCATAGC 
TCCCGTATCG 


901 


ATCAATAGAT 
TAGTTATCTA 


GTGCTGCCCC 
CACGACGGGG 


TCAGTCCGTT 
AGTCAGGCAA 


6ATACCTACT 
CTATGGATGA 


TGGTAAGCCC 
ACCATTCGGG 
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951 


TCCCGTATCG 
A6G6CATA6C 


TAGTTATCTA 
ATCAATAGAT 


cacgacgggg 
gtgctgcccc 


AGTCAGGCAA' CTATGGATGA 
TCAGTCCGTT GATACCTACT 


1001 


ACGAAATAGA 
TGCTTTATCT 


CAGATCGCTG 
GTCTAGCGAC' 


agataggtgc 
tctatccacg 


CTCACTGATT 
GAGTGACTAA 


AAGCATTGGT 
TTCGTAACCA 


1051 


AACT6TCAGA 
TTGACAGTCT 


CCAAGTTTAC 
GGTTCAAATG 


TCATATATAC 
AGTATATATG 


TTTAGATTGA 
AAATCTAACT 


TTTAAAACTT 
AAATTTTGAA 


1101 


CATTTTTAAT 
GTAAAAATTA 


TTAAAAGGAT 
AATTTTCCTA 


CTAGGTGAAG 
GATCCACTTC 


ATC,CTTTTTG 
TAQGAAAAAC 


ATAATCTCAT 
TATTAGAGTA 


1151 


GACCAAAATC 
CTGGTTTTAG 


CCTTAACGTG 
GGAATTGCAC 


AGTTTTCGTT 
TCAAAAGCAA 


CCACTGAGCG 
GGTGACTCGC 


TCAGACCCCG 
AGTCTGGGGC 


1201 


TAGAAAAGAT 
ATCTTTTCTA 


CAAAGGATCT 
GTTTCCTAGA 


TCTTGAGATC 
AGAACTCTAG 


CTTTTTTTCT 
GAAAAAAAGA 


GCGCGTAATC 
CGCGCATTAG 


12fal ■ 


TGCTGCTTGC 
ACGACGAACG 


AAACAAAAAA 
TTTGTTTTTT 


ACCACCGCTA 
TGGTGGCGAT 


CCAGCGGTGG 
GGTCGCCACC 


TTTGTTTGCC 
AAACAAACGG 


13 0.1 


GGATCAAGAG 
CCTA'GTTCTC 


CTACCAACTC 
GATGGTTGAG 


TTTTTCCGAA 
AAAAAGGCTT 


GGTAACTGGC 
CCATTGACCG 


TTCAGCAGAG 
AAGTCGTCTC 


1351 


. CGCAGATACC 
GCGTCTATGG 


AAATACTGTC 
TTTATGACAG 


CTTCTAGTGT 
GAAGATCACA 


AGCCGTAGTT 
TCGGCATCAA 


AGGCCACCAC 
TCCGGTGGTG 


1401 


TTCAAGAACT 
AAGTTCTTGA 


CTGTAGCACC 
GACATCGTGG 


GCCTACATAC 
CGGATGTATG 


CTCGCTCTGC 
GAGCGAGACG 


TAATCCTGTT 
ATTAGGACAA 


14 51 


ACCAGTGGCT 
TGGTCACCGA 


GCTGCCAGTG 
CGACGGTCAC 


GCGATAAGTC 
CGCTATTCAG 


GTGTCTTACC 
CACAGAATGG 


GGGTTGGACT 
CCCAACCTGA 


1501 


CAAGACGATA 
GTTCTGCTAT 


GTTACCGGAT 
CAATGGCCTA 


AAGGGGCAGC 
TTCCGCGTCG 


GGTCGGGCTG 
CCAGCCCGAC 


AACGGGGGGT 
TTGCCCCCCA 


1551 


TCGTGCACAC 
AGCACGTGTG 


AGCCCAGCTT 
TCGGGTCGAA 


GGAGCGAACG 
CCTCGCTTGC 


ACCTACACCG 
TGGATGTGGC 


AACTGAGATA 
TTGACTCTAT 


1601 


CCTACAGCGT 
GGATGTCGCA 


GAGCTATGAG 
CTCGATACTC 


AAAGCGCCAC 
TTTCGCGGTG 


GCTTCCCGAA 
CGAAGGGCTT 


GGGAGAAAGG 
CCCTCTTTCC 


1651 


CGGACAGGTA 
GCCTGTCCAT 


TCCGGTAAGC 
AGGCCATTCG 


GGCAGGGTCG 
CCGTCCCAGC 


GAACAGGAGA 
CTTGTCCTCT 


GCGCACGAGG 
CGCGTGCTCC 


1701 


GAGCTTCCAG 
CTCGAAGGTC 


GGGGAAACGC 
CCCCTTTGCG 


CTGGTATCTT 
GACCATAGAA 


TATAGTCCTG 
ATATCAGGAC 


TCGGGTTTCG 
AGCCCAAAGC 


1751 


CCACCTCTGA 
GGTGGAGACT 


CTTGAGCGTC 
GAACTCGCAG 


GATTTTTGTG 
CTAAAAACAC 


ATGCTCGTCA 
TACGAGCAGT 


GGGGGGCGGA 
CCCCCCGCCT 


1801 


GCCTATGGAA AAACGCCAGC 
CGGATACCTT TTTGCGGTCG 


AACGCGGCCT 
TTGCGCCGGA 


TTTTACGGTT 
AAAATGCCAA 


CCTGGCCTTT 
GGACCGGAAA 


1851 


TGCTGGCCTT 
ACGACCGGAA 


ttgctcacat 
aacgagtg'ta 


GTTCTTTCCT 
CAAGAAAGGA 


GCGTTATCCC 
CGCAATAG6G 


CTGATTCTGT 
GACTAAGACA 
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2851 


CAATCAACAA 
GTTAGTTGTT 


AACAATGATG 
TTGTTACTAC 


ATGTAATCAT 
TACATTAGTA 


AAGGAAGTAG 
TTCCTTCATC 


TTTAAAATAG 
AAATTTTATC 


2901 


GTTAATAAGT 
CAATTATTCA 


TTATTAGTTA 
AATAATCAAT 


TATAGAAAAT 
ATATCTTTTA 


AATATAGGAT 
TTATATCCTA 


AAAGTATAAG 
TTTCATATTC 


2S'51 


GATTAAGGTA 
CTAATTCCAT 


TGAGGTGTGT 
ACTCCACACA 


GGCTCAACAC 
CCGAGTTGTG 


GTAGGGTGAC 
CATCCCACTG 


AAGAAAATCT 
TTCTTTTAGA 


3001 


ACTGTAATAG 
TGACATTATC 


GACACAACAC 
CTGTGTTGTG 


CTCTAAAGTT 
GAGATTTCAA 


GCCCGTGGGA 
CGGGCACCCT 


AGGTGAAGTG 
TCCACTTCAC 


30S1 


AGATCGAATC 
TCTAGCTTAG 


TTTCCTTAAC 
AAAGGAATTG 


GCAGACAGCT 
CGTCTGTCGA 


TTTTATCCAC 
AAAATAGGTG 


TAGGGATAAT 
ATCCCTATTA 


3101 


GTTTTAAGGA 
CAAAATTCCT 


'atactatagt 
tatgatatca 


AATAGATTGA 
TTATCTAACT 


TAGTTTTAAC 
ATCAAAATTG 


AATGATGGAA 
TTACTACCTT 


3151 


ATAGTATATA 
TATCATATAT 


AGGATAGTTT 
TCCTATCAAA 


CTAGATTGTA 
GATCTAACAT 


CGGGAGCTCT 
GCCCTCGAGA 


TCACTACTCG 
AGTGATGAGC 


3201 


CTGCGTCGAG 
GACGCAGCTC 


AGTGTACGAG 
TCACATGCTC 


ACTCTCCAGG 
TGAGAGGTCC 


TTTGGTAAGA 
AAACCATTCT 


AATATTTTAT 
TTATAAAATA 


3251 


ATTGTTATAA 
TAACAATATT 


TGTTACTATG 
ACAATGATAC 


ATCCATTAAC 
TAGGTAATTG 


ACTCTGCTTA 
TGAGACGAAT 


TAGATTGTAA 
AXCTAACATT 


3301* 


. GGGTGATTGC 
CCCACTAACG 


AATGCTTTCT 
TTACGAAAGA 


GCATAAAACT 
CGTATTTTGA 


TTGGTTTTCT 
AACCAAAAGA 


T6TTAATCAA 
ACAATTAGTT 


3351 


TAAACCGACT 
ATTTGGCTGA 


TGATTCGAGA 
ACTAAGCTCT 


ACCTACTCAT 
TGGATGAGTA 


ATATTATTGT 
TATAATAACA 


CTCTTTTATA 
GAGAAAATAT 


3401 


CTTTATTAAG 
GAAATAATTC 


TAAAAGGATT 
ATTTTCCTAA 


TGTATATTAG 
ACATATAATC 


CCTTGCTAAG 
GGAACGATTC 


GGAGACATCT 
CCTCTGTAGA 


3451 


AGTGATATAA 
TCACTATATT 


GTGTGAACTA 
CACACTTGAT 


CACTTATCTT 
GTGAATAGAA 


AAATGATGTA 
TTTACTACAT 


ACTCCTTAGG 
TGAGGAATCC 


3501 


ATAATCAATA 
T ATTAG TT AT 


TACAAAATTC 
ATGTTTTAAG 


CATGACAATT 
GTACTGTTAA 


GGCGCCCAAC 
CCGCGGGTTG 


GTGGGGCTCG 
CACCCCGAGC 


3551 


AATATAAGTC 
TTATATTCAG 


GGGTTTATTT 
CCCAAATAAA 


GTAAATTATC 
CATTTAATAG 


CCTAGGGACC 
GGATCCCTGG 


TCCGAGCATA 
AGGCTCGTAT 


3601 . 


GCGGGAGGCA 
CGCCCTCCGT 


TATAAAAGCC 
ATATTTTCGG 


AATAGACAAT 
TTATCTGTTA 


GGCTTCAGGA 
CCGAAGTCCT 


AGTAATGTTG 
TCATTACAAC 


3651 


AAGAATATGA 
TTCTTATACT 


ACTTGATGTT 
TGAACTACAA 


GAAGCTCTGG 
CTTCGAGACC 


TTGTAATTTT 
AACATTAAAA 


AAGAGATAGA 
TTCTCTATCT 


3701 


AATATACCAA 
TTATATGGTT 


GAAATCCTTT 
CTTTAGGAAA 


ACATGGAGAA 
TGTACCTCTT 


GTTATAGGTC 
CAATATCCAG 


TTCGCCTTAC 
AAGCGGAATG 


3751 


TGAAGGATGG 
ACTTCCTACC 


T6GGGACAAA 
ACCCCTGTTT 


TTGAGAGATT 
AACTCTCTAA 


TCAGAT.GGTA 
AGTCTACCAT 


C6TCTAATAT 
GCAGATTATA 
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3 8 01 .TAdAAGATGA TGATAATGAA CCTTTACAGA GACCTA6ATA TGAGGTAATA 
ATGTTCTACT ACTATTACTT GGAAATGTCT CTGGATCTAT ACTCCATTAT 



3 851 CAACGAGCTG TAAACCCTCA TACAATGTTT ATGATATCAG GACCATTAGC* 
GTTGCTCGAC ATTTGGGAGT ATGTTACAAA TACTATAGTC CTGGTAATCG 



3 901 TGAACTTCAA TTAGCCTTTC AGGATTTAGA TTTACCTGAA GGTCCATTGA 
A.CTTGAAGTT AATCGGAAAG TCCTAAATCT AAATGGACTT CCAGGTAACT 



3 9 51 GGTTTGGTCC ATTGGCAAAT GGACATTAT6 TTCAAGGAGA TCCTTATAGT 
CCAAACCAGG TAACCGTTTA CCTGTAATAC AAGTTCCTCT AGGAATATCA 



4 0 01 AGTTCTTACA GACCAGTAAC AATGGCCGAA ACAGCCCAAA TGACTAGAGA 
TCAAGAATGT CTGGTCATTG TTACCGGCTT TGTCGGGTTT ACTGATCTCT 



4 051 TGAACTGGAA GATGTTdTTA ATACTCAAAG TGAAATAGAA ATTCAAATGA 
ACTTGACCTT CTACAAGAAT TATGAGTTTC ACTTTATCTT TAAGTTTACT 



4101 TAAATTTATT GGAGTTGTAT GAAGTTGAAA CTAGAGCTCT TAGAAGACAA 
ATTTAAATAA .CCTCAACATA CTTCAACTTT GATCTCGAGA ATCTTCTGTT 



4151 TTAGCTGAGA GATCTAGTAC AGGGCAAGGA GGAATATCCC CAGGAGCTCC 
AATCGACTCT CTAGATCATG TCCCGTTCCT CCTTATAGGG GTCCTCGAGG' 



4 2 01 TCGTTCTCGA CCACCAGTAA GCAGCTTCTC AGGGTTACCA A6TTTGCCCT 
AGCAAGAGCT GGTGGTCATT CGTCGAAGAG TCCCAATGGT TCAAACGGGA 



4251 CTATACCTGG GATTCATCCC AGGGCACCTT CACCTCCAAG GGCAACTTCT 
GATATGGACC CTAAGTAGGG TCCCGTGGAA GTGGAGGTTC CCGTTGAAGA 



43 01 ACTCCCGGAA ATATTCCTTG GAGTTTAGGA GATGATAGCC CACCTTCATC 
TGAGGGCCTT TATAAGGAAC CTCAAATCCT CTACTATCGG GTGGAAGTAG 



4 3 51 TAGTTTTCCT GGACCCTCTC AACCTCGTGT TTCTTTCCAT CCGGGAAATC 
ATCAAAAGGA CCTGGGAGAG TTGGAGCACA AAGAAAGGTA GGCCCTTTAG 



4 401 CTTTTGTTGA AGAAGAAGGT CATAGACCTA GATCCCAGTC TAGAGAAAGG 
GAAAACAACT TCTTCTTCCA GTATCTGGAT CTAGGGTCAG ATCTCTTTCC 



4 4 51 T^GAAGAGAAA TTCTTCCTGC TCCTGTACCG TCAGCACCTC CTATGATTCA 
TCTTCTCTTT AAGAAGGACG AGGACATGGC AGTCGTGGAG GATACTAAGT 



4 501 GTATATACCA GTACCACCTC CACCACCGAT TGGCACGGTT ATACCTATTC 
CATATATGGT CATGGTGGJlG GTGGTGGCTA ACCGTGCCAA TATGGATAAG 



4 5 51 AGCATATCAG ATCTGTAACT GGAGAGCCTC CTAGAAACCC AAGAGAAATA 
TCGTATAGTC TAGACATTGA CCTCTCGGAG* GATCTTTGGG TTCTCTTTAT 



4 601 CCAATTTGGC TAGGACGAAA TGCTCCTGCT ATAGATGGAG TGTTCCCTGT 
GGTTAAACCG ATCCTGCTTT ACGAGGACGA TATCTACCTC ACAAGGGACA 



4 651 TACAACACCG GATCTAAGAT GCAGAATAAT TAATGCTATA CTAGGAGGAA 
ATGTTGTGGC CTAGATTCTA CGTCTTATTA ATTACGATAT GATCCTCCTT 



4 7 01 ATATTGGGCT ATCATTAACC CCTGGAGACT GTTTAACATG GGACTCAGCA 
TATAACCCGA TAGTAATTGG GGACCTCTGA CAAATTGTAC CCTGAGTCGT 
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4751 


GTAGCCACCT 
CATCGGTGGA 


TATTTATTAG 
ATAAATAATC 


AACCCATGGA 
TTGGGTACCT 


ACTTTTCCAA 
TGAAAAGGTT 


TGCATCAGCT 
ACGTAGTCGA 


4801 


TGGAAATGTA 
ACCTTTACAT 


ATAAAAGGCA 
TATTTTCCGT 


TAGTTGATCA 
ATCAACTAGT 


AGAAGGAGTG 
TCTTCCTCAC 


GCAACAGCAT 
CGTTGTCGTA 


4851 


ATACTTTGGG 
TATGAAACCC 


AATGATGCTT 
TTACTACGAA 


TCTGGACAAA 
AGACCTGTTT 


ATTATCAATT .AGTTTCTGGA 
TAATAGTTAA TCAAAGACCT 


4901 


ATAATTAGAG 
T ATT AAT CTC 


GATATTTGCC 
CTATAAACGG 


TGGACAAGCT 
ACCTGTTC6A 


GTAGTAACTG 
CATCATTGAC 


CATTACAACA 
GTAATGTTGT 


4951 


GCGTTTAGAC 
CGCAAATCTG 


CAAGAAATAG 
GTTCTTTATC 


ATAATCAAAC 
TATTAGTTTG 


AAGAGCAGAG 
TTCTCGTCTC 


ACTTTTATTC 
TGAAAATAAG 


5001 


AACATCTAAA 
TTGTAGATTT 


TGCTGTATAT 
ACGACATATA 


GAAATTTTAG 
CTTTAAAATC 


GCCTTAATGC 
CGGAATTACG 


CAGAGGACAA 
GTCTCCTGTT 


5051 


' AGTATACGTG 
TCATATGCAC 


CTTCAGTGAC 
GAAGTCACTG 


TCCTCAACCC 
AGGAGTTGGG 


CGACCATCCA 
GCTGGTAGGT 


GAGGTAGAGG 
CTCCATCTCC 


5101 


TCGAGGTCAA 
AGCTCCAGTT 


AATACTTCTA 
TTATGAAGAT 


GACCCTCT'CA 
CTGGGAGAGT 


AGGACCAGCT 
TCCTGGTCGA 


AATAGCGGGC 
TTATCGCCCf^ 


5151 


GGGGACGACA 
CCtCTGCTGT 


GCGCCCTGCT 
CGCGGGACGA 


TCTGGTCAAA 
AGACCAGTTT 


GCAACAGAGG 
CGTTGTCTCC 


ATCTAGTACT 
TAGATCATGA 


5201 


CAGAATCAAA 
GTCTTAGTTT 


ATCAAGATAA 
TAGTTCTATT 


TTTAAATCAA 
AAATTTAGTT 


GGAGGATATA 
CCTCGTATAT 


ATCTTCGACC 
TAGAAGCTGG 


3251 


CCGTACTTAC 
GGCATGAATG 


CAACCTCAAA 
GTTGGAGTTT 


GGTACGGAGG 
CCATGCCTCC 


AGGACGTGGA 
TCCTGCACCT 


CGAAGATGr,A 
GCTTCTACCT 


5 SOI 


ACGATAATAC 
TGCTATTATG 


TAACAATCAA 
ATTGTTAGTT 


GAGTCCAGAC 
CTCAGGTCTG 


CAT C AG AT C A 
GTAGTCTAQT 


AGGTTCTCAA 
TCCAAGAGT7 


5351 


ACTCCTAGGC 
TGAGGATCCG 


CAAATCAAGC 
GTTTAGT T CG 


AGGCTCTGGG 
TCCGAGACCC 


GTGCGTGGCA 
CACGCACCGT 


ATCAGTCACA 
TAGTCAGTGT 


5401 


AACTCCCAGA 
TTGAGGGTCT 


ccagctgctg 
ggtcgaCtgac 


GTCGCGGAGG 
CAGCGCCTCC 


AAGAGGTAAC 
TTCTCCATTG 


CACAACCGAA 
GTGTTGGCTT 


54 51 


ACCAACGATC 
TGGTTGCTAG 


atccggtgct 
taggccacga 


GGTGACTCAC 
CCACTGAGTG 


GCGCTGTCAA 
CGCGACAGTT 


TACCGTGACA 
ATGGCACTGT 


5501 


CAGAGTGCCA 
GTCTCACGGT 


CGTCCTCCAC 
GCAGGAGGTG 


AGATGAATCC 
TCTACTTAGG 


TCTTCAGCTG 
AGAAGTCGAC 


TTACAGCCGC 
AATGTCGGCG 


5551 


TTCCGGCGGA 
AAGGCCGCCT 


GATCAAAGGG 
CTAGTTTCCC 


ACTAAATTGT 
TGATTTAACA 


TAGCCCACTG 
ATCGGGTGAC 


GGATTCAGGG 
CCTAAGTCCC 


5601 


GCAACAATAA 
CGTTGTTATT 


CTTGTATTCC 
GAACATAAGG 


TGAAAGTTTT 
ACTTTCAAAA 


ttagaIagatg 
aatcttctac 


AACAACCTAT 
T-TGTTGGATA 


5651 


TAAAAAGACT 
ATTTTTCTGA 


TTAATAAAAA 
AATTATTTTT 


CAATTCATG6 
GTTAAGTACC 


AGAAAAACAA 
TCTTTTTGTT 


CAAAATGTTT 
GTTTTACAAA 
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5701 


ATTATGTAAC 
TAATACATTG 


CTTTAAAGTT 
GAAATTTCAA 


AAAGGAAGAA 
TTTCCTTCTT 


AAGTGGAAGC 
TTCACCTTCG 


AGAAGTGATA 
TCTTCACTAT 


5751 


GCTTCTCCTT 
CGAAGA6GAA 


ATGAGTATAT 
TACT CAT AT A 


TTTGCTGTCG 
AAA.CGACAGC 


CCAACAGATG 
GGTTGTCTAC 


TTCCTTGGTT 
AAGGAACCAA 


5801 


AACACAGCAA 
TTGTGTCGTT 


CCACTTCAGT 
GGTGAAGTCA 


TAACAATTTT 
ATTGTTAAAA 


AGTTCCTCTT 
TCAAGGAGAA 


CAAGAATATC 
GTTCTTATAG 


5651 


AAGA6AAAAT 
TTCTCTTTTA 


CTTAAGTAAG 
GAATTCATTC 


ACTGCTCTTC 
TGACGAGAAG 


CAGAAGATCA 
GTCTTCTAGT 


AAAACAACAA 
TTTTGTTGTT 


5901 


TTAAAAACCT 
AATTTTTGOA 


TGTTTGTCAA 
ACAAACAGTT 


GTATGACAAT 
CATACTGTTA 


CTATGGCAAC 
GATACCGTTG 


ATTGGGAAAA 
TAACCCTTTT 


5951 


TCAAGTCGGG 
AGTTCAGCCC 


CATAGAAAAA 
GTATCTTTTT 


TTAGGCCACA 
AATCCGGTGT 


TAATATAGCA 
ATTATATCGT 


ACTGGTGATT 
TGACCACTAA 


6001 


ATCCTCCTCG 
TAGGAGGAGC 


CCCTCAAAAA 
GGGAGTTTTT 


CAATATCCTA 
GTTATAGGAT 


TTAATCCTAA 
AATTAGGATT 


GGCAAAGCCT 
CCGTTTCGGA 


6051 


AGTATACTVAA 
TCATATGTTT 


TTGTAATAGA 
AACATTATCT 


TGACTTATTG 
ACTGAATAAC 


AAACAAGGGG 
TTTGT??CCCC 


TGTTAACGCC 
ACAATTGCGG 


6101 


TCAAAATAGT 
AGTTTTATCA 


ACAATGAATA 
TGTTACTTAT 


CACCAGTGTA 
GTGGTCACAT 


TCCTGTTCCT 
AGGACAAGGA 


AAACCAGATG 
TTTGGTCTAC 


6151 


GAAGGTGGAG 
CTTCCACCTC 


AATGGTATTA 
TTACCATAAT 


GATTATAGAG 
CTAATATCTC 


AAGTAAATAA 
TTCATTTATT 


AACTATTCCA 
TTGATAAGGT 


6201 


TTAACAGCTG 
AATTGTCGAC 


CCCAAAACCA 
GGGTTTTGGT 


ACACTCTGCT 
TGTGAGACGA 


GGTATTTTAG 
CCATAAAATC 


CTACTATTGT 
GATGATAACA 


6251 


TAGACAAAAA 
ATCTGTTTTT 


TATAAAACTA 
ATATTTT6AT 


CCTTAGATTT 
GGAATCTAAA 


AGCTAATGGA. 
TCGATTACCT. 


TTTTGGGCTC 
AAAACCCGAG 


6301 


ATCCTATTAC 
TAGGATAATG 


ACCAGAATCT TATTGGTTAA 
TGGTCTTAGA ATAACCAATT 


CAGCATTTAC 
GTCGTAAATG 


CTGGCAAGGT 
GACCGTTCCA . 


6351 


AAACAGTATT 
TTTGTCATAA 


GTTGGACACG 
CAACCTGTGC 


TCTTCCTCAA 
AGAAGGAGTT 


GGATTTTTAA 
CCTAAAAATT 


ATAGTCCAGC 
TATCAGGTCG 


6401 


ATTGTTTACA 
TAACAAATGT 


GCTGATGTAG 
CGACTACATC 


TAGATTTACT 
ATCTAAATGA 


AAAAGAAATC 
TTTTCTTTAG 


CCTAATGTAG 
GGATTACATG 


6451 


AAGTGTATGT 
TTCACATACA 


TGATGATATA 
ACTACTATAT 


TATTTAAGCC 
ATAAATTCGG 


ATGATGATCC 
TACTACTAGG 


TAAAGAGCAT 
ATTTCTCGTA 


6501 


GTTCAACAAT 
CAAGTTGTTA 


TAGAAAAAGT 
ATCTTTTTCA 


GTTTCAAATT 
CAAAGTTTAA 


TTACTACAGG 
AATGATGTCC 


CAGGATATGT 
GTCCTATACA 


6551 


AGTATCTTTG 
TCATAGAAAC 


AAAAAATCAG 
TTTTTTAGTC 


AAATTGGTCA 
TTTAACCAGT 


AAAAACTGTA 
TTTTTGACAT 


GAATTTTTAG 
CTTAAAAATC 


6601 


GATTTAATAT 
CTAAATTATA 


TACTAAAGAA 
ATGATTTCTT 


GGTCGTGGCC 
CCAGCACCGG 


TAACAGACAC 
ATTGTCTGTG 


TTTTAAAACA 
AAAATTTTGT 
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6651 


AAACTGTTAA 
TTTGACAATT 


ATATTACTCC 
TATAATGAGG 


TCCAAAAGAC 
AGGTTTTCTG 


TTAAAGCAAT. 
AATTTCGTTA 


TACAAAGCAT 
ATGTTTCGTA 


6701 


ATTAGGATTG 
TAATCCTAAC 


TTAAATTTTG 
AATTTAAAAC 


CTAGAAATTT 
GATCTTTAAA 


TATACCTAAT 
ATATGGATTA 


TTTGCTGAAC 
AAACGACTTG 


6751 


TGGTACAACC 
ACCATGTTGG 


ATTATACAAT 
TAATATGTTA 


TTAATAGCCT 
AATTATCGGA 


CAGCAAAAGG 
GTCGTTTTCC 


CAAATATATT 
GTTTATATAA 


6801 


GAGTGGTCTG 
CTCACCAGAC 


AAGAAAATAC 
TTCTTTTATG 


TAAACAATTA 
ATTTGTTAAT 


AATATGGTAA 
TTATACCATT 


TAGAAGCATT 
ATCTTCGTAA 


6851 


AAACACTGCC 
TTTGTGACGG 


TCTAATTTAG 
AGATTAAATC 


AAGAAAGGTT 
TTCTTTCCAA 


ACCAGAACAG 
TGGTCTTGTC 


AGACTGGTAA 
TCTGACCATT 


6901 


TTAAAGTCAA 
AATTTCAGTT 


TACTTCTCCA 
ATGAAGAGGT 


TCAGCAGGAT 
AGTCGTCCTA 


ATGTAAGATA 
TACATTCTAT 


TTATAATGAG 
AATATTACTC 


6951 


ACTGGTAAAA 
TGACCATTTT 


AGCCTATTAT 
TCGGATAATA. 


GTACCTAAAT 
CATGGATTTA 


TAT6TGTTTT 
ATACACAAAA 


CCAAAGCAGA 
GGTTTCGTCT 


7001 


ATTAAAATTT 
TAATTTTAAA 


TCTATGTTAG 
AGATACAATC 


AAAAACTATT 
TTTTTGATAA 


AACTACAATG 
TTGATGTTAC 


CACAAAGCCT 
GT6TTTCGGA 


7051 


TAATTAAGGC TATGG^^^tTTG 6CCATGGGAC AAGAAATATT AGTTTATAGT ' 
ATTAATTCCG ATJ^rCf&TAAAC CGGTACCCTG TTCTTTATAA TCAAATATCA 


7101 


CCCATTGTAT 
GGGTAACATA 


CTATGACTAA 
GATACTGATT 


AATACAAAAA 
TTATGTTTTT 


ACTCCACTAC 
TGAGGTGATG 


CAGAAAGAAA 
GTCTTTCTTT 


7151 


AGCTTTACCC 
TCGAAATGGG 


AT'CAGATGGA 
TAATCTACCT 


TAACATGGAT 
ATTGTACCTA 


GACTTATTTA 
CTGAATAAAT 


GAAGATCCAA 
CTTCTAGGTT 


7201 


GAATCCAATT 
CTTAGGTTAA 


TCATTATGAT 
AGTAA.TACTA 


AAAACCTTAC' 
TTTTGGAATG 


CAGAACTTAA 
GTCTTGAATT 


GCATATTCCA 
CGTATAAGGT 


7251 


GATGTATATA 
CTACATATAT 


CATCTAGTCA 
GTAGATCAGT 


GTCTCCTGTT 
CAGAGGACAA 


AAACATCCTT 
TTTGTAGGAA 


CTCAATATGA 
GAGTTATACT 


7301 


AGGAGTGTTT 
TCCTCACAAA 


TATACTGATG 
ATATGACTAC 


GCTCGGCCAT 
CGAGCCGGTA 


CAAAAGTCCT 
GTTTTCAGGA 


GATCCTACAA 
CTAGGATGTT 


7351 


AAAGCAATAA 
TTTCGTTATT 


TGCTGGCATG 
ACGACCGTAC 


GGAATAGTAC 
CCTTATCATG 


ATGCCACATA 
TACGGTGTAT 


CAAACCTGAA 
GTTTGGACTT 


7401 


TATCAAGTTT 
ATAGTTCAAA 


TGAATCAATG 
ACTTAGTTAC 


GTCAATACCA 
CAGTTATGGT 


CTAGGTAATC 
GATCCATTAG 


ATACTGCTCA 
TATGACGAGT 


7451 


GATGGCTGAA 
CTACCGACTT 


ATAGCTGCAG 
TATCGACGTC 


TTGAATTTGC 
AACTTAAACG 


CTGTAAAAAA 
GACATTTTTT 


GCTTTAAAAA 
CGAAATTTTT 


7501 


TACCTGGTCC 
ATGGACCAGG 


TGTATTAGTT 
ACATAATCAA 


ATAACTGATA 
TATTGArtAT 


GTTTCTATGT 
CAAAGATACA 


AGCAGAAAGT 
TCGTCTTTCA 


7551 


GCTAATAAAG 
CGATTATTTC 


AATTACCATA 
TTAATGGTAT 


CTGGAAATCT 
GACCTTTAGA 


AATGGGTTTG 
TTACCCAAAC 


TTAATAATAA 
AATTATTATT 
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7601 GAAAAAGCCT- CTTAAACATA TCTCCAAATG GAAATCTATT GCTGAGTGTT 
CTTTTTCGGA GAATTTGTAT AGAGGTTTAC CTTTAGATAA CGACTCACAA 



7651 


TATCTATGAA 
ATAGATACTT 


. ACCAGACATT 
TGGTCTGTAA 


ACTATTCAAC 
. TGATAAGTTG 


: ATGAAAAAGG 
TACTTTTTCC 


'r CATCAGCCTA 
; GTAGTCGGAT 


7701 


CAAATACCAG 
GTTTATGGTC 


TATTCATACT 
ATAAGTATGA 


GAAAGGCAAT 
CTTTCCGTTA 


GCCCTAGCAG ATAAGCTTGC 
CGGGATCGTC .TATTCGAACG 


7751 


CACCCAAGGA 
GTGGGTTCCT 


AGTTATGTGG 
TCAATACACC 


TTAATTGTAA 
AATTAACATT 


TACCAAAAAA 
ATGGTTTTTT 


CCAAACCTGG 
GGTTTGGACC 


7 8 01 


ATGCAGAGTT 
TACGTCTCAA 


GGATCAATTA 
CCTAGTTAAT 


TTACAGGGTC 
AATGTCCCAG 


ATTATATAAA 
TAATATATTT 


AGGATATCCC 
TCCTATAGGG 


7851 


AAACAATATA 
TTTGTTATAT 


CATATTTTTT 
GTATAAAAAA 


AGAAGATGGC 
TCTTCTACCG 


AAAGTAAAAG TTTCCAGACC 
TTTCATTTTC AAAGGTCTGG 


79C1 


TGAAGGGGTT 
ACTTCCCCAA 


AAAATTATTC 
TTTTAATAAG 


CCCCTCAGTC 
GGGGAGTCAG 


AGACAGACAA 
TCTGTCTGTT 


AAAATT6TGC 
TTTTAACACG 


7951 


TTCAAGCCCA 
AAGTTCGGGT 


CAATTTGGCT 
GTTAAACCGA 


CACACCGGAC 
GTGTGGCCTG 


GTGAAGCCAC 
CACTTCGGTG 


TCTTTTAAAA 
AGAAAATTTT 


BOOx 


ATT6CCAACC 
TAACGGTTGG 


TTTATTGGTG 
AAATAACCAC 


GCCAAATATG 
•C6GTTTATAC 


AGAAAGGATG 
TCTTTCCTAC 


TGGTTAAACA 
ACCAATTTGT 


8051 


ACTAGGACGC 
TGATCCTGCG 


TGTCAACAGT 
ACAGTTGTCA 


GTTTAATCAC 
CAAATTAGTG 


AAATGCTTCC 
TTTACGAAGG 


AACAAAGCCT 
TTGTTTCGGA 


8101 


CTGGTCCTAT 
GACCAGGATA 


TCTAAGACCA 
AGATTCTGGT 


GATAGGCCTC 
CTATCCGGAG 


AAAAACCTTT 
TTTTTGGAAA 


TGATAAATTC 
ACTATTTAAG 


8151 


TTTATTGACT 
AAATA.IVCTGA 


ATATTGGACC 
TATAACCTGG 


TTTGCCACCT 
AAACGGTGGA 


TCACAGGGAT 
AGTGTCCCTA 


ACCTATATGT 
TGGATATACA 


8201 


ATTAGTAGTT 
TAATCATCAA 


GTTGATGGAA 
CAACTACCTT 


TGACAGGATT 
ACTGTCCTAA 


CACTTGGTTA 
GTGAACCAAT 


TACCCCACTA 
Al'GGGGTGAT 


8251 


•AGGCTCCTTC 
TCCGAGGAAG 


TACTAGCGCA 
ATGATCGCGT 


ACTGTTAAAT 
TGACAATTTA 


CTCTCAATGT 
GAGAGTTACA- 


ACTCACTAGT 
TGAGTGATCA 


8301 


ATTGCAATTC 
TAACGTTAAG 


CAAAGGTGAT TCACTCTGAT 
GTTTCCACTA AGTGAGACTA 


CAAGGTGCAG 
6TT.CCACGTC 


CATTCACTTC 
GTAAGTGAAG 


8 3 51 


TTCAAC CTTT 
AAGTTGGAAA 


GCTGAATGGG 
CGACTTACCC 


CAAAGGAAAG 
GTTTCCTTTC 


AGGTATACAT 
TCCATATGTA 


TTGGAATTCA 
AACCTTAAGT 


8401 


GTACTCCTTA 
CATGAGGAAT 


TCACCCCCAA 
AGTGGGGGTT 


AGTGGTAGTA ' 
TCACCATCAT 


AGGTGGAAAG 
TCCACCTTTC 


GAAAAATAGT 
CTTTTTATCA 


8451 


GATATAAAAC 
CTATATTTTG 


GACTTTTAAC 
CTGAAAATTG 


TAAACTGCTA 
ATTTGACGAT 


GTAGGAAGAC* 
CATCCTTCTG 


CCACAAAGTG 
GGTGTTTCAC 


8501 


GTATGACCTA 
CATACTGGAT , 


TTGCCTGTTG ' 
AACGGACAAC . 


TACAACTTGC ' 
ATGTTGAACG . 


TTTAAACAAC 
AAATTTGTTG 


ACCTATAGCC 
TGGATATCGG 
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B551 


CTGTATTAAA 
GACATAATTT 


ATATACTCCA 
TATATGAGGT 


CATCAACTCT 
GTAGTTGAGA 


.TATTTGGTAT 
ATAAACCATA 


AGATTCAAAT 
TCTAAGTTTA 


8601 


ACTCCATTTG 
TGAGGTAAAC 


CAAATCAAGA 
GTTTAGTTCT 


TACACTTGAC 
ATGTGAACTG 


TTGACCAGAG 
AACTGGTCTC 


AAGAAGAACT 
TTCTTCTTGA 


80*51 


TTCTCTTTTA 
AAGAGAAAAT 


CAGGAAATTC 
GTCCTTTAAG 


GTACTTCTTT 
CATGAAGAAA 


ATACCATCCA 
TATGGTAGGT 


TCCACCCCTC 
AGGTGGQGAG 


8 7 01 


CAGCCTCCTC 
GTCGGAGGAG 


TCGTTCCTGG 
AGCAAGGACC 


TCTCCTGTTG 
AGAGGACAAC 


TTGGCCAATT 
AACCGGTTAA 


GGTCCAGGAG 
CCAGGTCCTC 


3751 


AGGGTGGCTA 
TCCCACCGAT 


G6CCTGCTTC 
CC6GACGAAG 


TTTGAGACCT 
AAACTCTGGA 


CGTTGGCATA 
GCAACCGTAT 


AACCGTCTAC 
TTGGCAGATG. 


8801 


TGTACTTAAG 
ACATGAATTC 


GTGTTGAATC 
CACAACTTAG 


CAAGGACTGT 
GTTCCTGACA 


TGTTATTTTG 
ACAATAAAAC 


GACCATCTTG 
CTGGTAGAAC 


8851 


GCAACAACAG 
CGTTGTTGTC 


AACTGTAAGT 
TTGACATTCA 


ATAGATAATT 
TATCTATTAA 


TAAAACCTAC 
ATTTTGGATG 


TTCTCATCAG 
AAGAGTAGTC 


8901 


AATGGCACCA 
TTACCGTGGT 


CCAATGACAC 
GGTTAXTGTG 


TGCAACAATG 
ACGTTGTTAC 


GATCATTTGG 
CTAGTAAACC 


AAAAAAATGA 
TTTTTTTACT 


8951 


ATAAAGCGCA 
TATTTCGCGT 


TGAGGCACTT 
ACTCCGTGAA 


CAAAATACAA 
GTTTTATGTT 


CAACTGTGAC 
GTTGACACTG 


TGAACAGCAG 
ACTTGTCGTC 


9001 


AAGGAACAAA 
TTCCT-TGTTT 


TTATACTGGA 
AATATGACCT 


CATTCAAAA-T 
GTAAGTTTTA 


GAAGAAGTAC 
CTTCTTCATG 


AACCAACTAG 
TTGGTTGATC 


9051 


GA.GAGATAAA 
CTCTCTATTT 


TTTAGATATC 
AAATCTATAG 


TGCTTTATAC 
ACGAAATATG 


TTGTTGTGCT 
AACAACACGA" 


ACTAGCTCAA 
TGATCGAGTT 


9101 


GAGTATTGGC 
CTCATAACCG 


CTGGATGTTT 
GACCTACAAA 


TTAGTTTGTA 
AATCAAACAT 


TATTGTTAAT 
ATAACAATTA 


CATTGTTTTG 
GTAACAAAAC 


9151 


GTTTCATGCT 
CAAAGTACGA 


TTGTGACTAT 
AACACTGATA 


ATCCAGAATA 
TAGGTCTTAT 


CAATGGAATA 
GTTACCTTAT 


AGGATATTCA 
TCCTATAAGT 


9201 


GGTAT-TAGGA. 
CCA.TAATCCT 


CCTGTAATAG 
GGACATTATC 


ACTGGAATGT 
TGA-CCTTACA 


TACTCAAAGA 
ATGAGTTTCT 


GCTGTTTATC 
CGACAAATAG 


9251 


AACCCTTACA 
TTGGGAATGT 


GACTAGAAGG 
CTGATCTTCC 


ATTGCACGTT 
TAACGTGCAA 


CCCTTAGAAT 
GGGAATCTTA 


GCAGCATCCT 
CGTCGTAGGA 


9301 


GTT.CCAAAAT 
CAAGGTTTTA 


ATGTGGAGGT 
TACACCTCCA 


AAATATGACT 
TTTATACTGA 


AGTATTCCAC 
TCATAAGGTG 


AAGGTGTATA 
TTCCACATAT 


9351 


CTATGAACCC 
GATACTT6GG 


CATCCGGAAC 
GTA6GCCTTG 


CCATAGTGGT 
GGTATCACCA 


GAAGGAGAGG 
CTTCCTCTCC 


GTCCTAGGTC 
CAGGATCCAG 


9401 


TTTCTCAAAT 
AAAGAGTTTA 


TCTGATGATT 
AGACTACTAA 


AATTCAGAAA 
TTAAGTCTTT 


ACATTGCTAA 
TGTAACGATX 


TAATGCTAAT 
ATTACGATTA 


9451 


TTGACACAAG 
AACTGTGTTC 


AAGTAAAGAA 
TTCATTTCTT 


GTTGTTAACT 
CAACAATTGA 


GAAATGGTTA 
CTTTACCAAT 


ATGAAGAAAT 
TACTTCTTTA 
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9 5 01 GCAAAGTTTG TCAGATGTAA TGATTGACTT TGAAATTCCT TTAGGAGACC 
CGTTTCAAAC AGTCTACATT ACTAACTGAA ACTTTAAGGA AATCCTCTGG 



9551 


CTCGTGATCA 
GAGCACTAGT 


AGAACAATAT 
TCTTGTTATA 


ATACATAGAA 
TATGTATCTT 


AATGCTATCA 
TTACGATAGT 


AGAATTTGCA 
TCTTAAACGT 


9601 


AAT T GT T ATT 
TTAACAATAA 


TAGTAAAATA 
ATCATTTTAT 


TAAAGAACCC 
ATTTCTTGGG 


AAACCGTGGC 
TTTGGCACCG 


CTAAGGAGGG 
GATTCCTCCC 


9651 


CCTTATAGCT 
GGAATATCGA 


GATCAATGCC 
CTAGTTACGG 


CATTACCAGG 
GTAATGGTCC 


TTACCATGCT 
AATGGTACGA 


GGATTAACCT 
CCTAATTGGA 


9701 


ATAATAGACA 
TATTATCTGT 


GTCTATTTGG 
CAGATAAACC 


GATTACTATA 
CTAATGATAT 


TTAAAGTGGA 
AATTTCACCT 


GAGTATTAGA 
CTCATAATCT 


9751 


CCTGCAAATT 
GGACGTTTAA 


GGACAACAAA 
CCT6TT6TTT 


GAGTAAATAT 
CTCATTTATA 


GGACAAGCTA 
CCTGTTCGAT 


GACTAGGAAG 
CTGATCCTTC 


9801 


TTTTTATATT 
AAAAATATAA 


CCTAGCAGCC 
GGATCGTCGG 


TGAGACAAAT 
ACTCTGTTTA 


CAATGTTAGT 
GTTACAATCA 


CATGTACTAT 
GTACATGATA 


9851 


TCTGTAGTGA 
AG AC AT C ACT 


TCAATTATAT 
AGTTAATATA 


TCTAAATGGT 
AGATTTACCA 


ATAATATAGA 
TATTATATCT 


AAATACCATA 
TTTATGGTAT 


9901 


GAACAAAACG 
CTTGTTTTGC 


AGCGGTTTCT 
TCGCCAAAGA 


GCTTAATAAA 
CGAATTATTT 


CTAAATAACC 
GATTTATTGG 


TTACATCTGG 
AATGTAGACC 


9951 


AACCTCAGTA 
TTGGAGTCAT 


TTGAAGAAAA 
AACTTCTTTT 


GAGCTCTTCC 
CTCGAGAAGG 


GAAGGATTGG 
-CTTCCTAACC 


AGTTCTCAAG 
TCAAGAGTTC 


loobi 


OTAAAAATGC 
CATTTTTACG 


TCT6TTTAGA 
AGACAAATCT 


GAAATCAATG 
CTTTAGTTAC 


TGTTAGATAT. 
ACAATCTATA 


CTGCAGTAAA 
GACGTCATTT 


10051 


CCTGAATCTG 
GGACTTAGAC 


TAATACTATT 
ATTATGATAA 


GAATACTTCA 
CTTATGAAGT 


TACTATTCCT 
ATGATAAGGA 


TCTCTTTATG 
AGAGAAATAC 


, 10101 


GGAAGGAGAT 
CCTTCCTCTA 


TGTAATTTTA 
ACATTAAAAT 


CTAAAGATAT 
GATTTCTATA 


GATTTCTCAG 
CTAAAGAGTC 


TTGGTTCCAG 
AACCAAGGTC 


10151 


AATGTGATGG 
TTACACTACC ' 


ATTTTATAAC 
TAAAATATTG 


AATTCTAAGT 
TTAAGATTCA 


GGATGCATAT 
CCTAC6TATA 


GCATCCATAT 
CGTAGGTATA 


10201 


GCTTGTAGAT 
CGAACATCTA 


TCTGGAGAAG 
AGACCTCTTC 


TAAGAAGAAT 
ATTCTTCTTA 


GAAAAAGAAG 
CTTTTTCTTC 


AAACTAAATG 
TTTGATTTAC 


102 51 


TAGAGATGGG 
ATCTCTACCC 


GAAACTAAGA 
CTTTGATTCT 


GATGTCTGTA 
CT AC AG A CAT 


TTATCCTTTA 
AATAGGAAAT 


TGGGACAGTC 
ACCCTGTCAG 


10301 


CCGAATCTAC 
GGCTTAGATG 


ATATGATTTT 
TATACTAAAA 


GGTTATTTAG 
CCAATAAATC 


CATACCAAAA 
GTATGGTTTT 


GAATTTTCCT 
CTTAAAAGGA 


10351 • 


TCCCCTATCT 
AGGGGATAGA 


GTATAGAACA 
CATATCTTGT 


ACAGAAAATT 
TGTCTTTTAA 


AGAGATCAAG 
TCTCTAGTTC 


ATTATGAAGT 
TAATACTTCA 


10401 


CTATTCTTTG 
GATAAGAAAC 


TATCAAGAAC 
ATAGTTCTTG 


GCAAAATAGC 
CGTTTTATCG 


TTCTAAAGCA 
AAGATTTCGT 


TATGGAATTG 
ATACCTTAAC 
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104 51 ATACAGTTTT ATTCTCTCTA AAGAATTTTC TTAATTATAC AGGAACTCCT 
TATGTCAAAA TAAGAGAGAT TTCTTAAAAG AATTAATATG TCCTT6AGGA 



10 501 GTAAATGAAA TGCCTAATGC AAGAGCTTTT GTAGGCCTAA TAGATCCCAA 
CATTTACTTT ACGGATTACG TTCTCGAA^LA CATCCGGATT ATCTAGGGTT 



10 5 51 GTTTCCTCCT TCCTATCCCA ATGTTACTAG GGAACATTAT ACTTCCTGTA 
CAAAGGAGGA AGGATAGGGT TACAATGATC CCTTGTAATA TGAAGGACAT 



10 6 01 ATAATAGGAA AAGAAGAAGT GTTGATAATA ACTATGCTAA GTTAAGGTCT 
TATTATCCTT TTCTTCTTCA CAACTATTAT TGATACGATT CAATTCCAGA 



10 651 ATGGGGTATG CACTTACAGG AGCAGTGCAA ACCTTATCTC AAATATCAGA 
TACCCCATAC GTGAATGTCC TCGTCACGTT TGGAATAGAG TTTATAGTCT 



10 7 01 TATTAATGAT GAAAACTTAC AGCAAGGAAT ATATTTATTA AGGGATCATG 
ATAATTACTA CTTTTGAATG TCGTTCCTTA TATAAATAAT TCCCTAGTAC 



10 751 TAATAACCTT AATGGAAGCT ACATTGCATG ATATATCTGT TATGGAAGGA 
ATTATTGGAA TTACCTTCGA TGTAACGTAC TATATAGACA ATACCTTCCT 



10 8 01 ATGTTTGCTG TACAACATTT GCATACACAT TTGAATCATT TGAAGACAAT 
TACAAACGAC A1*GTTGTAAA CGTATGTGTA AACTTAGTAA ACTTCTGTTA 



10 8 51 GCTTCTAGAA AGAAGAATAG ACTGGACCTA TATGTCTAGT ACTTGGCTAC 
CGAAGATCTT TCTTCTTATC TGACCTGGAT ATACAGATCA TGAACCGATG 



10 901 AACAACAATT ACAGAAATCT GATGATGAGA TGAAAGTAAT AAAGAGAATT 
TTGTTGTTAA TGTCTTTAGA CTACTACTCT ACTTTCATTA TTTCTCTTAA 



109 51 GCTAGAAGTT TGGTATATTA TGTTAAACAA ACCCATA6TT CTCCCACAGC 
CGATCTTCAA ACCATATAAT ACAATTTGTT T6GGTATCAA GAGGGTGTCG 



110 01 TACAGCCTGG GAGATTGGAT TATATTATGA ATTGGTTATA CCTAAACATA 
ATGTCGGACC CTCTAACCTA ATATAATACT TAACCAATAT GGATTTGTAT 



11051 TTTACTTGAA TAATTGGAAT GTTGTCAATA TAGGTCACTT AGTTAAATCA 
AAATGAACTT ATTAACCTTA CAACAGTTAT ATCCAGTGAA TCAATTTAGT 



11101 GCTGGACAAT TGACTCATGT AACTATAGCT CATCCTTATG AAATAATCAA 
CGACCTGTTA ACTCAGTACA TTGATATCGA GTAGGAATAC TTTATTAGTT 



11151 TAAGGAATGT GTAGAGACTA TATATCTGCA TCTTGAGGAC TGCACAAGAC 
ATTCCTTACA CATCTCTGAT- ATATAGACGT AGAACTCCTG ACGTGTTCTG 



112 01 AAGATTATGT CATATGTGAT GTGGTAAAGA TAGTGCAGCC TTGTGGCAAT 
TTCTAATACA GTATACACTA CACCATTTCT ATCAC6TCGG AACACCGTTA 



112 51 AGCTCAGACA CGAGTGATTG TCCTGTCTGG GCTGAAGCTG TAAAAGAACC 
TCGAGTCTGT GCTCACTAAC AGGACAGACC CGACTTCGAC ATTTTCTTGG 



113 01 ATTTGTGCAA GTCAATCCTC TGAAAAACGG AAGTTATCTG GTTTTGGCAA 
TAAACACGTT CAGTTAGGAG ACTTTTTGCC TTCAATAGAC CAAAACCGTT 



113 51 GTTCCACAGA CTGTCAGATC CCACCATATG TTCCTAGCAT CGTGACTGTT 
CAAGGTGTCT GACAGTCTAG GGTGGTATAC AAGGATCGTA GCACTGACAA 
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114 01 AATGAAACAA CGTCATGCTT TGGACTGGAC TTTAAAAGGC CACTGGTTGC 
TTACTTTGTT GCAGTACGAA ACCTGACCTG AAATTTTCCG GTGACCAACG 



114 51 GGAAGAAAGA TTGAGCTTTG AGCCACGACT GCCAAATCTA CAACTAAGAT 
CCTTCTTTCT AACTCGAAAC TCGGTGCTGA CGGTTTAGAT GTTGATTCTA 



115 01 TACCACATTT GGTTGGAATT ATTGCAAAAA TCAAAGGGAT AAAAATAGAA 
ATGGTGTAAA CCAACCTTAA TAACGTTTTT AGTTTCCCTA TTTTTATCTT 



115,51 GTCACATCCT CTGGAGAAAG TATAAAAGAG CAGATTGAAA GAGCAAAAGC 
CAGTGTAGGA GACCTCTTTC ATATTTTCTC GTCTAACTTT CTCGTTTTCG 



11601 TGAGCTCCTT CGACTGGACA TTCACGAGGG AGATACTCCT GCCTGGATAC 
ACTCGAGGAA GCT6ACCTGT AAGTGCTCCC TCTATGAGGA CGGACCTATG 



116 51 AACAGCTAGC TGCAGCAACA AAGGACGTCT GGCCAGCAGC AGCTTCTGCT 
TTGTCGATCG ACGTCGTTGT TTCCTGCAGA CCGGTCGTCG TCGAAGACGA 



117 01 CTACAAGGAA TTGGTAACTT TTTATCTGGG ACTGCCCAAG GAATATTTGG 
GATGTTCCTT AACCATTGAA AAATAGACCC TGACGGGTTC CTTATAAACC 



11751 AACTGCCTTT AGTCTCTTGG GATACTTAAA GCCTATCCTA ATAGGAGTAG 
TTGACGGAAA TCAGAGAACC. CTATGAATTT CGGATAGGAT TATCCTCATC 



118 01 GGGTCATTCT CTTGGTTATT CTTATATTTA AGATTGTATC ATGGATTCCT 
CCCAGTAAGA GAACCAATAA GAATATAAAT TCTAACATAG TACCTAAGGA 



118 51 ACGAAAAAGA AGAATCAGTA GCCTCCACCT CTGGAATTCA AGACCTGCAG 
TGCTTTTTCT TCTTAGTCAT CGGAGGTGGA GACCTTAAGT TCTGGACGTC 



11901 ACTCTGAGTG AGCTTGTTGG TCCTGAAAAT GCCGGAGAGG GAGAGGTGAC 
TGAGACTCAC TCGAACAACC AGGACTTTTA CGGCCTCTCC CTCTCGACTG 



119 51 TATTGCTGAG GAACCTGAAG AAAATCCTCG ACGCCCCAGA CGATATACTA 
ATAACGACTC CTTGGACTTC TTTTAGGAGC TGCGG6GTCT GCTATATGAT 



12 001 AAAGAGAAGT CAAATGTGTG TCTTATCATG CATATAAAGA AATTGAGGAC 
TTTCTCTTCA GTTTACACAC AGAATAGTAC GTATATTTCT TTAACTCCTG 



12 0 51 AAACATCCTC AACATATTAZV ACTGCAGGAT TGGATCCCCA CACCAGAGGA 
TTTGTAGGAG TTGTATAATT TGACGTCCTA" ACCTAGGGGT GTGGTCTCCT 



12101 AATGAGTAAG TCACTCTGTA AAAGACTTAT TTTATGTGGA TTGTATAGTG 
TTACTCATTC AGTGAGACAT TTTCTGAATA AAATACACCT AACATATCAC 



12151 CAGAAAAGGC CTCAGAGATT TTAAGGATGC CTTTTACAGT TTCTTGGGAA 
GTCTTTTCCG GAGTCTCTAA AATTCCTACG GAAAATGTCA AAGAACCCTT 



122 01 CAATCAGATA CTGACCCTGA CTGTTTTATT GTAAGCTATA CATGTATATT 
GTTAGTCTAX GACTGGGACT GACAAAATAA CATTCGATAT GTACATATAA 



12251 TTGTGATGCT GTAATACATG ATCCCATGCC CATAAGATGG GATCCTGAAG 
AACACTACGA CATTATGTAC TAGGGTX'CGG GTATTCTACC CTAGGACTTC 



123 01 TTGGAATTTG GGTAAAATAT AAACCCCTCA GAGGAATTGT TGGATCTGCT 
AACCTTAAAC CCATTTTATA TTTGGGGAGT ctccttaaca acctagacga 
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12 3 51 GTGTTTATTA TGCATAAACA TCAAAGAAAG TGTTCTCTTG TTAAACCTTC 
CACAAATAAT ACGTATTTGT AGTTTCTTTQ ACAAGAGAAC AATTTGGAAG 



124 01 TACCAGTCGC TCAGAAGGTC CAAAACCAAG ACCTAG6CAC GATCCTGTCC 
ATGGTCAGCG AGTCTTCCAG GTTTTGGTTC TGGATCCGTG CTAGGACAGG 



12451 TTCGATGTGA CATGTTTGAA AAGCATCACA AGCCTCGGCA GAAACGACCC 
AAGCTACACT GTACAAACTT TTCGTAGTGT TCGGAGCCGT CTTTGCTGGG 



12 501 AGGAGACGAT CCATCGATAA TGAGTCATGT GCTTCCAGTA GTGACACCAT 
TCCTCTGCTA GGTAGCTATT ACTCAGTACA CGAAGGTCAT CACTGTGGTA 



. 125 51 GGCCAATGAG CCAGGATCAC TATGCACCAA CCCTCTTTGG AATCCTGGAC 
CCGGTTACTC 6GTCCTAGTG ATACGTGGTT GGGAGAAACC TTAGGACCTG 



12 601 CGCTACTATC AGGGCTACTT GAAGAGTCCA GCAACCTACC AAACTTGGAA 
GCGATGATAG TCCC6ATGAA CTTCTCAGGT CGTTG6ATGG TTTGAACCTT 



12 6 51 GTTCACATGT CAGGTGGACC CTTCTGGGAA GAGGTTTATG GGGACTCAAT 
CAAGTGTACA GTCCACCTGG GAAGACCCTT CTCCAAATAC CCCTGAGTTA . 



12701 TTTGGGTCCC CCCTCTGGGT CAGGTGAACA TTCAGTTTTA TAAGAATTAT 
AAACCCAGGG GGGAGACCCA GTCCACTTGT AAGTCAAAAT ATTCTTAATA 



12 7 51 CAGATTCTAA CTTGCTGTCA GGCTGTAGAT CTTCTTAAGC TTGCGGGAGA 
GTCTAAGATT GAACGACAGT CCGACATCTA GAAGAATTCG AACGCCCTCT 



ApaX 



12 8 01 CGTCGAGTCC AACCCTG6GC CCGATATCCC CATGGTGAGC AAGGGCGAGG 
GCAGCTCAGG TTGGGACCCG GGCTATAGGG GTACCACT.CG TTCCCGCTCC 



12851 AGCTGTTCAC CGGGGTGGTG CCCATCCTGG TCGAGCTGGA CGGCGACGTA • 
TCGACAAGT6 GCCCCACCAC GGGTAGGACC AGCTC6ACCT GCCGCTGCAT 



1^901 AiVCGGCCACA AGTTCAGCGT GTCCGGCGAG GGCGAGGGCG ATGCCACCTA 
TTGCCGGTGT TCAAGTCGCA CAGGCCGCTC CCGCTCCCOC TACGGTGGAT 



12 951 CGGCAAGCTG ACCCTGAAGT TCATCTGCAC CACCGGCAAG CTGCCCGTGC 
GCCGTTCGAC TGGGACTTCA AGTAGACGTG GTGGCCGTTC GACGGGCACG 



130 01 CCTGGCCCAC CCTCGTGACC ACCTTCACCT ACGGCGTGCA GTGCTTGAGC 
GGACCGGGTG GGAGCACTGG TGGAAGTGGA TGCCGCACGT CACGAAGTCG 



13 051 CGCTACCCCG ACCACATGAA GCAGCACGAC TTCTTCAAGT CCGCCATGCC 
GCGATGGGGC TGGTGTACTT CGTCGTGCTG AAGAAGTTCA. GGCGGTACGG 



1310-1 CGAAGGCTAC GTCCAGGAGC GCACCATCTT CTTCAAGGAd GACGGCAACT 
GCTTCCGATG CAGGTCCTCG CGTGGTAGAA GAAGTTCCTG CTGCCGTTGA 



13151 ACAAGACCCG CGCCGAGGTG AAGTTCGAGG GC6ACACCCT GGT6AACCGC 
TGTTCTGGGC GCGGCTCCAC TTCAAGCTCC CGCTGTGGGA CCACTTGGCG 



132 01 ATCGAGCTGA AGGGCATCGA CTTCAAGGAG GACGGCAACA TCCTGGGGCA 
TAGCTCGACT TCCCGTAGCT GAAGTTCCTG CTGCCGTTGT AGGACCCCGT 



132 51 CAAGCTGGAG TACAACTACA ACAGCCACAA CGTCTATATC ATGGCCGACA 
GTTCGACCTC ATGTTGATGT TGTCGGTGTT GCAGATATAG TACCGGCTGT 
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13 3 01 AGCAGAAGAA CGGCATCAAG GTGAACTTCA AGATCCGCCA CAACATCGAG 
TCGTCTTCTT GCCGTAGTTC CACTTGAAGT TCTA6GC6GT GTTGTAGCTC 



13 3 51 GACGGCAGCG TGCAGCTCGC CGACCACTAC CAGCAGAACA CCCCCATCGG 
CTGCCGTCGC AC6TCGAGCG GCTGGTGATG GTCGTCTTGT GGGGGTAGCC 



13 4 01 CGACGGCCCC GTGCTGCTGC CCGACAACCA CTACCTGAGC ACCCAGTCCG 
GCTGCCGGGG CACGACGACG GGCTGTTGGT GATGGACTCG TGGGTCAGGC 



13 4 51 CCCTGAGCAA AGACCCCAAC GAGAAGCGCG ATCACATGGT CCTGCTGGAG 
GGGACTCGTT TCTGGGGTTG CTCTTCGCGC TAGTGTACCA GGACGACCTC 



13 5 01 TTCGTGACCG CCGCCGGGAT CACTCTCGGC A^GGACGAGC TGTACAAGTA 
AAGCACTGGC GGCGGCCCTA GTGAGAGCCG TACCTGCTCG ACATGTTCAT 



NotI 



13 5 51 AAGCGGCCGC GACTCTAGGG GATTCGC^^AT AAGTAAGTAA GCTTATGGAC 
TTCGCCGGCG CTGAGATCCC CTAAGCGCTA TTCATTCATT CGAATACCTG 



13 601 CTCAGAGAGG AAGTAACGAG GAGAGGGTGT GGTGGAATGT CACTAGAAAC 
GAGTCTCTCC TTCATTGCTC CTCTCCCACA CCACCTTACA* GTGATCTTTG 



13 6 51 CAGGGAAAAC AAGGAGGAGA GTATTACAGG GAAGGAGGTG AAGAACCTCA 
GTCCCTTTTG TTCCTCCTCT CATAATGTCC CTTCCTCCAC TTCTTGGAGT 



13 7 01 TTACCCAAAT ACTCCTGCTC CTCATAGACG TACCTGGGAT GAGAGACACA 
AATGGGTTTA TGAGGACGAG GAGTATCTGC ATGGACCCTA CTCTCTGTGT 



13 7 51 AGGTTCTTAA ATTGTCCTCA TTCGCTACTC CCTCTGACAT CCAACGCTGG 
TCCAAGAATT TAACAGGAGT AAGCGATGAG GGAGACTGTA GGTTGCGACC 



13 8 01 GCTACTAAAG CATTGCCTTA TGGCTGGAAA GTGGTCACCG AAAGCGGAAA 
CGATGATTTC GTAACGGAAT ACCGACCTTT CACCAGTGGC TTTCGCCTTT. 



1 38 51 * TGATTATACT AGCCGCAGAA AGATCAGAAC ATTGACAGAG ATGACTCAGG 
ACTAATATGA TCGGCGTCTT TCTAGTCTTG TAACTGTCTC TACTGAGTCC 



13 9 01 ATGAAATTAG AAAAAGGTGG GAAAGTGGAT. ATTGTGACCC CTTCATTGAC 
TACTTTAATC TTTTTCCACC CTTTCACCTA TAACACTGGG GAAGTAACTG 



13 951 TCAGGAAGTG ACTCAGATGG ACCCTTCTAA AAGCCACAGA CAGTAAAAAT 
AGTCCTTCAC TGAGTCTACC TGGGAAGATT TTCGGTGTCT GTCATTTTTA 



14 0 01 GTGTTAGCAC TTTATACAAT ATTATATCTG CTTAAGCTAT AGAAGCTTTC 
CACAATCGTG AAATATGTTA TAATATAGAC GAATTCGATA TCTTCGAAAG 



14051 ACATACTCAG TAGCTGTTTC ACAATCAACA AAACAATGAT GATGTAATCA 
TGTATGAGTC ATCGACAAAG TGTTAGTTGT TTTGTTACTA CTACATTAGT 



14101 TAAGGAAGTA GTTTAAATAG GTTAATAAGT TTATTAGTTA TATAGAAAAT 
ATTCCTTCAT CAAATTTATC CAATTATTCA AATAATCAAT ATATCTTTTA 



14151 AATATAGGAT AAAGTATAAG GATTAAGGTA TGAGGTGTGT GGCTCAACAC 
TTATATCCTA TTTCATATTC CTAATTCCAT ACTCCACACA CCGAGTTGTG 



142 01 GTAGGGTGAC AAGAAAATCT ACTGTAATAG GACACAACAC CTCTAAAGTT 
. CATCCCACTG TTCTTTTAGA TGACATTATC CTGTGTTGTG GAGATTTCAA 
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SEQ ID NO: 2 

a tgggtgcgag agcgtcggta ttaagcgggg gagaattaga 
taaatgggaa 

841 aaaattcggt taaggccagg gggaaagaaa caatataaac taaaacatat 
agtatgggca 

901 agcagggagc tagaacgatt cgcagttaat cctggccttt tagagacatc 
agaaggctgt 

961 agacaaatac tgggacagct acaaccatcc cttcagacag gatcagaaga 
acttagatca 

1021 ttatataata caatagcagt cctctattgt gtgcatcaaa ggatagatgt 
aaaagacacc 

1081 aaggaagcct tagataagat agaggaagag caaaacaaaa gtaagaaaaa 
ggcacagcaa 

1141 gcagcagctg acacaggaaa caacagccag gtcagccaaa attaccctat 
agtgcagaac 

1201 ctccaggggc aaatggtaca tcaggccata tcacctagaa ctttaaatgc 
atgggtaaaa 

12 61 gtagtagaag agaaggcttt cagcccagaa gtaataccca tgttttcagc 
attatcagaa 

1321 ggagccaccc cacaagattt aaataccatg ctaaacacag tggggggaca 
tcaagcagcc 

1381 atgcaaatgt taaaagagac catcaatgag gaagctgcag aatgggatag 
attgcatcca 

1441 gtgcatgcag ggcctattgc accaggccag atgagagaac caaggggaag 
tgacatagca 

1501 ggaactacta gtacccttca ggaacaaata ggatggatga cacataatcc 
acctatccca 

1561 gtaggagaaa tctataaaag atggataatc ctgggattaa ataaaatagt 
aagaatgtat 

1621 agccctacca gcattctgga cataagacaa ggaccaaagg aaccctttag 
agactatgta 

1681 gaccgattct ataaaactct aagagccgag caagcttcac aagaggtaaa 
aaattggatg 

1741 acagaaacct tgttggtcca aaatgcgaac ccagattgta agactatttt 
aaaagcattg 

1801 ggaccaggag cgacactaga agaaatgatg acagcatgtc agggagtggg 
gggacccggc 

1861 cataaagcaa gagttttggc tgaagcaatg agccaagtaa caaatccagc 
taccataatg 

1921 atacagaaag gcaattttag gaaccaaaga aagactgtta agtgtttcaa 
ttgtggcaaa 

1981 gaagggcaca tagccaaaaa ttgcagggcc cctaggaaaa agggctgttg 
gaaatgtgga 

2041 aaggaaggac accaaatgaa agattgtact gagagacagg ctaatttttt 
agggaagatc 

2101 tggccttccc acaagggaag gccagggaat tttcttcaga gcagaccaga 
gccaacagcc 

2161 ccaccagaag agagcttcag gtttggggaa gagacaacaa ctccctctca 
gaagcaggag 

2221 ccgatagaca aggaactgta tcctttagct tccctcagat cactctttgg 
cagcgacccc 
2281 tcgtcacaat aa 



SEQ ID NO: 3 



MGARASVI^GGELDKWEKIEILRPGGKKQYKLKHIWASRELE^ 
TSEGCRQILGQLQPSLQTGSEEIJISLYNTIAVLYC\^QRIDVKDT^ 
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QNK5KKKAQQAAADTGNNSQVSQNYPIVQNLQGQMVHQAISPRTLNAWVKV 

X^EKLAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAE 

WDRLHPVHAGPIAPGQMREPRGSDIAGTTSTIX^EQIGWMTHNPPIPVGEIYKR 

WnLGLNKIVRMYSFrsnJDIRQGPKEPFRDYVDRF^TLR^ 

.ETLLVQNA2SIPDCKTILKALGPGATLEEMMTACQGVGGPGHKARVLAEAMSQ 

VTNPATIMIQKGNFRNQRKTVKCFNCGKEGHIAKNCRAPRKKG^ 

QMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAPPEESFRFGEETTTPSQK 

QEPIDKBLYPLASLRSLFGSDPSSQ 



SEQ ID NO: 4 

MGARASVl^GGELDRWEKIRLRPGGKKKYKLKHIWASRELERFAVNIPGL^^ 

TSEGCRQE.GQLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEAI.DKIEEE 

QNKSKKKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQAISPRTLNAWVKV 

VEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETESIEEAAE 

WDRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNl^PIPVGEIYKR 

Win:.GLNK[VR]!^SPTSILDIRQGPK:EPFRDYVDREfYKT^ 

ETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLAEAMSQ 

VTNSATIMMQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKGCWCGKEG 

HQMKDCTERQAJSIFLGKrWPSYKGRPGNFLQSRPEPTAPPFLQSRPEPTAPPEES 

ERSGVETTTPSQKQEPIDKELYPLtSLRSLFGlSIDPSSQ 



SEQ ID NO: 5 

MGARASVLSGGELDRWEKVRLRPGGKKKYKLKHTVWASRELERFAVNPGLLE 

TSEGCRQILGQLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTkEALDKIEEE 

QNKSKKKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQAISPRTLNA\\^VKV 

VEEKAFSPEVn>MFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETESnEEAAE 

WDRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKR 

WELGLNKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWM 

ETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLAEAMSQ 

VTNSATIMMQRGNFRNQRKIVKCFNCGKEGHIAIWCRAPRKKGCWKCGKEG 

HQMKDCTERQANFLGKIWPSYKGRPGNFLQSRPEPTAPPFLQSRPEPTAPPEES 

FRSGVETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 



SEQ ID NO: 6 

TCC GGG CCC GGA ATG CCT ATA GTC CAG AAC ATC C 



SEQ ID NO: 7 

GCG GCC GCG TTT TGA GAA CGA AAT ACC GG 



SEQ ID NO: 8 
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wo 2004/003153 ^WCT/US2003/020325 



SEQ ID NO:l with SEQ ID NO: 2 inserted between 12816 and 13552. 
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